Gene Tery_4208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4208 
Symbol 
ID4245860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6489249 
End bp6490973 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content42% 
IMG OID638109105 
ProductGTP-binding protein, HSR1-related 
Protein accessionYP_723683 
Protein GI113477622 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.552873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATCG AAACTATCTA CGGAAATCTG AAAGGTCTAA AATCTAGCCA ACTTAAACAA 
CTGCAAAGGT TGTATCATCA GCGACTAACA AGCGATCGCC TAACTACCTC TGAGTTTGCT
CAAAGAGTCG CCTCCATCAG TACAGAAGTT GGCCAATCAA TCTGCGTATA TATTAATCGT
AGGGGACAAA TTATCCGTGT GGGAGTTGGT ACACCCCACC AAACCCAAAT TCCACCCTTG
GAATTACCTC GTTATGGTAA TGGTCGTCTT AGTGGTATTC GTTGTATTGC TACTAACCTG
AAAGCAGAAG ATCCTTCAGA AGCTGCTCTA ACTGCAATGG TAATTCAACG ACTTGATGCT
CTAGTGATAC TGACGCTAAC CGGTGAAGGA TTTAAACGTC GAGGTGGAGG AGCAACAGGT
TACATTAAAC AAGCTTATCT AGCTCACCTA TTACCCATTC CCACACACGA AGGTGTGAAT
ACTGATATTC AAAAGCTGAA TTTTTGGAGC GTATCACCTC CTCTTAGTCT TGATGCTTTG
GCCAAACAAG ACTTCATTGA CTTAATTGAT GGACTAGAAG CAGAATTTCA ACGAGAATTT
ATTGCTCAAG AAGTTGATTC TGACCAAGAC CTAGTCCTTT TGGTGGGGCT AAAAATAGAT
AAAATGACCA ACCAGCAATT TGAAGATGGA TTAGCTGAAT TAGCTCGACT TGTTGAAACG
GCTGGAGGTA AGGTCCTAAA AACAGTGGAT CAGAAGCGAT CGCATCCCCA TCCCCAAACA
GTTGTAGGAG AAGGTAAAGT TCAAGAAATT GCTCTGAGTG TTCAAACGGT CGGAGCGAAC
CTAGTTGTAT TTGACCAAGA CCTTTCCCCT GCTCAAGTAC GCAACCTAGA AACAAAAATT
GGTGTGAGAG TTGTAGACCG CACCGAAGTT ATTCTGGACA TCTTTGCCCA ACGCGCTCAA
TCTCGCGCTG GAAAGTTGCA AGTGGAATTA GCTCAATTAA AATATACTAT TCCCCATCTT
ACAGGCCAAG GCGAAGCAAT GTCTCGTCTG GGTGGTGGTA TTGGTACGAG GGGACCTGGT
GAAACTAAGT TAGAAACAGA GCGTCGGGCC ATTAACGCTA GGATATCTCG CTTGCAGAAA
GAAGTTAATC AATTACAAGC ACACCGTTCC CGCCTAAGAC AACATCGCCA ACATCAAGAA
GTGCCAACAG TGGCAATAGT GGGATACACG AATGCTGGTA AATCTACATT ATTGAATACA
CTTACCAATT CAGAAGTTTA TGCAGCAGAT AAACTATTTG CTACTCTAGA TCCTATTACT
CGTCGTTTGA ATGTTCCTGA TACTGTTACT GGAAAACCAA CTACAATTGT GATTACTGAC
ACGGTAGGGT TTATTCATGA ACTCCCTCCT ACTTTGATGA ATGCTTTTCG AGCAACTCTA
GAAGAAGTTA CAGATGCAGA CGCTTTACTT CATGTAGTAG ATCTGTCTCA CCCGGCTTGG
CAAAGTCAAA TACGCTCTGT GATGACTATT TTGCAGGAAA TGCCAGAAAC TCCAGGACCT
GCTTTAGTTG CTTTTAACAA AATTGATGAA GTAGATGGAG ATACTTTAAA ATTTGCTCAT
GAAGAATATC CGATGGCAGT ATTTATTTCT GCTGCAAAAG CTCTAGGTTT GGAAACCTTA
CGTCAACGAC TCGCACAGCT TATTGATTAT GTTGTTGCTT CTTGA
 
Protein sequence
MPIETIYGNL KGLKSSQLKQ LQRLYHQRLT SDRLTTSEFA QRVASISTEV GQSICVYINR 
RGQIIRVGVG TPHQTQIPPL ELPRYGNGRL SGIRCIATNL KAEDPSEAAL TAMVIQRLDA
LVILTLTGEG FKRRGGGATG YIKQAYLAHL LPIPTHEGVN TDIQKLNFWS VSPPLSLDAL
AKQDFIDLID GLEAEFQREF IAQEVDSDQD LVLLVGLKID KMTNQQFEDG LAELARLVET
AGGKVLKTVD QKRSHPHPQT VVGEGKVQEI ALSVQTVGAN LVVFDQDLSP AQVRNLETKI
GVRVVDRTEV ILDIFAQRAQ SRAGKLQVEL AQLKYTIPHL TGQGEAMSRL GGGIGTRGPG
ETKLETERRA INARISRLQK EVNQLQAHRS RLRQHRQHQE VPTVAIVGYT NAGKSTLLNT
LTNSEVYAAD KLFATLDPIT RRLNVPDTVT GKPTTIVITD TVGFIHELPP TLMNAFRATL
EEVTDADALL HVVDLSHPAW QSQIRSVMTI LQEMPETPGP ALVAFNKIDE VDGDTLKFAH
EEYPMAVFIS AAKALGLETL RQRLAQLIDY VVAS