Gene Tery_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2372 
Symbol 
ID4245020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3663405 
End bp3664739 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content42% 
IMG OID638107465 
Producthypothetical protein 
Protein accessionYP_722065 
Protein GI113476004 
COG category[S] Function unknown 
COG ID[COG3597] Uncharacterized protein/domain associated with GTPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACTG AAATGACTGA AACAAATGAT ATTAACTCCC CGGCAGCTAA TTCTGAACAG 
TCTACCCAAA CTACTCAGGA TGTAAAAACA AAATCAAAAA TAGACTCTTG GCAAAGTCAT
GTCAGTGGAA TTTGGAATGA TACCAACAAA CGTTTCATGA AACTTCTCCC CATTGGTAAG
GCAAGGCAAA CAATAATGCA ATGGTTTACT GTGGATGAAG CTGAAGTTGC AAAGATATTA
GTTAAGGTAC GCGCCGAACT TCCTACCACT GAGGCTATTT TAATTGGTAA ACCCCAAACA
GGTAAAAGTT CCATAGTACG CGGTTTGACA GGGGTTTCAA AAGAAATAAT CGGTCAAGGT
TTTCGCCCCC AGACTCAACA CACACAACGT TATGCTTATC CTTCTGATGA TTTGCCATTG
TTGATTTTTA CTGATACTGT GGGGTTAGGA GATGTGAAAC AGGAAACTCC TATTATTATT
CACGAGCTTT TGGCAGAGTT GCAGCAGGAA AGCAACCGCG CCAGAATCTT GATTTTGACT
GTAAAAATTA ATGATTTTGC TACGGATACT TTACGACAGG TAGCACAACA GTTGCGACAG
GAATATCCAA ATATTCCTTG TTTGTTAGCT GTGACTTGTC TGCACGAAGT TTACCCTCAC
TCCACTGAGG ACCATCCCGA ATATCCTCCA GAATATGAGG AGCTCCAGCG CGCTTTTGCT
ACTATTAAAG AAGATTTTGC CGAGTTGTAT AATAGTTCTG TATTGCTCGA CTTTACTTTG
GAAGAGGATG GGTATAACCC GGTGTTTTAT GGTTTAGAAG CCTTTAGAGA TAATTTAGCA
CAATTACTCC CAGAAGCAGA AGCACGAACA ATTCATCAAT TATTAGATGA GCAAACAAGT
AAACAACTGG GGAATATTTA CCGAGATGTT GCGAGGCGTT ACATTTTGTC ATTTACTATT
ATGGCAGCTA CGGCAGCAGC AGTACCTTTG CCTTTTGCAA CTATGCCAGT ATTGACAGCG
TTGCAAGTGT CGATGGTGGG TTTATTGGGA AATTTGTATG GGCAAACAAT TTCTCCATCA
CAAGCTGGAG GGGTTGTTAG TGCTATTGGT GGCGGTTTTT TAGCTCAAGC AGTGGGGCGA
GAGTTGATTA AATTTGTGCC TGGGTTGGGG AGCGCGATCG CTGCTTCTTG GGTAGCCGCT
TATACTTGGG CTTTAGGTGA AAGTGCTTGT GTTTATTTTG GAGATTTGAT GGGAGGAAAA
AAGCCAGATC CACAAAAAAT TCAGGGAGTA ATGCAGGAGG CTTTTGAGTC GGCACAGGAA
AGATTTAAAG GCTAG
 
Protein sequence
MITEMTETND INSPAANSEQ STQTTQDVKT KSKIDSWQSH VSGIWNDTNK RFMKLLPIGK 
ARQTIMQWFT VDEAEVAKIL VKVRAELPTT EAILIGKPQT GKSSIVRGLT GVSKEIIGQG
FRPQTQHTQR YAYPSDDLPL LIFTDTVGLG DVKQETPIII HELLAELQQE SNRARILILT
VKINDFATDT LRQVAQQLRQ EYPNIPCLLA VTCLHEVYPH STEDHPEYPP EYEELQRAFA
TIKEDFAELY NSSVLLDFTL EEDGYNPVFY GLEAFRDNLA QLLPEAEART IHQLLDEQTS
KQLGNIYRDV ARRYILSFTI MAATAAAVPL PFATMPVLTA LQVSMVGLLG NLYGQTISPS
QAGGVVSAIG GGFLAQAVGR ELIKFVPGLG SAIAASWVAA YTWALGESAC VYFGDLMGGK
KPDPQKIQGV MQEAFESAQE RFKG