Gene PHATRDRAFT_40416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40416 
Symbol 
ID7198304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp341072 
End bp343174 
Gene Length2103 bp 
Protein Length700 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184348 
Protein GI219128288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.541093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTCCA CGTACATTGT CTCCGTAATT CTTTGGACCG CAAGCGTAAC AGCTGTTGTG 
TGTACGGTTG GTATTTGCGC CGCGGAGGTA ACCTTCTGGA TTGGTCATGG TCTCGGAATT
CCCTGGGATC GGGTCCCACT CGCTTGGTAT CGTTGGGAAG AATTCCAAAG CTTGATTTCC
TCCAAGGCCG AAGAGGGGGC GCTCAAGGTG TGGATGGTTA TTATGGGACA AATGTCCTGG
ACGATCTTGT TGAGCTTGTT TCCATTTCCT TGTCATAAAG GATCATCCCG TCGCCAACAA
AAAGACCGAA TTACTGCCAC AACAAGCAGA CACCTGCCCG GTACCACAAG TGCGCCAAGT
CCTGCTAGAG GATTTACTCA TTTACTCACC GCAAAGAAGA GCGTGGCCGC ACTTTTGACT
ATTTACGGCA TATTAGTTCT CCTGCTACGA CCGTCTATTC CATACACAAG GCTTTCGACA
ACACCCTTGC TTATGGTGTC ACTGGAAGTC ACAAAGGGCT TTCAACAGTA TCAAGAGTTG
CTCCACGTAG CAAATAGCAA AGACGACTCA GGAGATCAAA ACAAGTGGTG GCGAGCTGAA
GTTAACGCAA TGAAACAGCT TGTTCAAAGA GCCCCGTTCG AGCGTGACTA TACGGATGAG
CCGATCAACG TTGTTCTGGT ATTTTTGGAA TCTGTTCGAG CAGATATGAT GCCGTTTGAT
GGCTCTACAC CATGGGCGAG GCGATTTGTG CCCAATATCA CCATCCATGA CAAGATAACA
CCATTTTACG ATCAATGGGT CAGAGACTCC AATTCAACAC TTTACGTTCC TCATATGAAA
TCGGCTTCTG GTTTGACACA CAAAAGCCTT GCAAGTACGC TTTGCTCATT GCATGCCTTG
CCTTTCCACG GCACCGTTGA GCATGCACAA AATCTTTATC ATCCCTGTCT TCCGCAGATA
CTTGACCGAT TAGGGTATGA GAGTCAGTTT TTCAAATCCT TGACCGAAAC CTTTGACCAC
CAAAACCACC TTATGCGAAA CATTGGATAC CCCAGAATGT ATGGGCGAGA GAGCTATGAT
CGTGCGAACC ATCCAACCGC AGCATTCCAA CAAAACCACA CGGCCAATTA TTTTGGTTAT
GAAGACATTG TGTTGCTGGA TCCTGTCACA GAATGGGTGG ATAGCCAGAC CAAACCCTTT
TTCCTTTCGT ATCTGAGCGG TATCACACAT GATCCGTATG CCATTCCACC TTCTGGAGGG
GGTTGGAAAC CACAATCGTT CAGTTTGGAT GAGAAAGTCA ACGGCTTTCT GAACGAAGTG
TCCTACCTTG ACACGTGGCT AGATCTGTTG GTCAAGTCGT TTGAAGATCG CCATTTGATG
AACTCAACTC TGTTTGTCTT TTTGGGAGAC CACGGCGGTC ACTTTAAGGA CCGAGATTCC
CAGTTTACCA CATTCGAGCA AAATTACGAA GAGGCTTTTG ACGTCGGCGT TACGTTTCAC
AGTCGCAATC CGCGTATCCA GAAGCTACTT CAAAAATCAC AGGCATTTGT GAGCGGCAAC
TGGACTTCGC TCGACATTGC TCCGACTTTG CTAGAACTGC TCTTTGGAAG AGCAATAAAC
CCAATGCGAC CAGGATCAGA GAAAGGATCG ATAAACAGTC CATATTCGAA GTACTTTGAT
AGCCGCGCGA GCGCAAGCTG GGTCGACGGT CGGTCCATGC TACGCGAATC AGGCTCCCGA
CTACGCTTGA GTATAGGTAA CCCTGGAGAG AGCTTGATGC TCCGAGATCA ATGTTTTGTG
CTGGTCTTTC CTCTAAAAGA AGATGGCCGT GCTCACCCCG AAGTGTTCAA TATTTGTGCG
GACCCGGGTC AGAATCAACC GTTGCAGCTT CTGCCAGTAT CGTCGTTCTC GATCCCAAAG
TCGGAACTCG AAAGATGGGG CCGAAAAGCC ATGTCATTCT GTCTACAAGT CAAATCAGAT
TTAGTACATG CTCACAAAAC GGGTATGCGT TGCCAAAAGT GCGCACTCGA GGAGTTGGTG
ACATTGGAGA CTTTGGAGAG TTGGTCTCCC ACTGTTGGGG GGGACAAACG TATCCGACGA
TAA
 
Protein sequence
MRSTYIVSVI LWTASVTAVV CTVGICAAEV TFWIGHGLGI PWDRVPLAWY RWEEFQSLIS 
SKAEEGALKV WMVIMGQMSW TILLSLFPFP CHKGSSRRQQ KDRITATTSR HLPGTTSAPS
PARGFTHLLT AKKSVAALLT IYGILVLLLR PSIPYTRLST TPLLMVSLEV TKGFQQYQEL
LHVANSKDDS GDQNKWWRAE VNAMKQLVQR APFERDYTDE PINVVLVFLE SVRADMMPFD
GSTPWARRFV PNITIHDKIT PFYDQWVRDS NSTLYVPHMK SASGLTHKSL ASTLCSLHAL
PFHGTVEHAQ NLYHPCLPQI LDRLGYESQF FKSLTETFDH QNHLMRNIGY PRMYGRESYD
RANHPTAAFQ QNHTANYFGY EDIVLLDPVT EWVDSQTKPF FLSYLSGITH DPYAIPPSGG
GWKPQSFSLD EKVNGFLNEV SYLDTWLDLL VKSFEDRHLM NSTLFVFLGD HGGHFKDRDS
QFTTFEQNYE EAFDVGVTFH SRNPRIQKLL QKSQAFVSGN WTSLDIAPTL LELLFGRAIN
PMRPGSEKGS INSPYSKYFD SRASASWVDG RSMLRESGSR LRLSIGNPGE SLMLRDQCFV
LVFPLKEDGR AHPEVFNICA DPGQNQPLQL LPVSSFSIPK SELERWGRKA MSFCLQVKSD
LVHAHKTGMR CQKCALEELV TLETLESWSP TVGGDKRIRR