Gene Tery_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0499 
Symbol 
ID4242347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp798757 
End bp800184 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content37% 
IMG OID638105813 
Productsmall GTP-binding protein 
Protein accessionYP_720427 
Protein GI113474366 
COG category[R] General function prediction only 
COG ID[COG1100] GTPase SAR1 and related small G proteins 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.362987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.723657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCG ACCGAGAACT AGACGAAACA ATATCAAGCT TTGCAGATAT TCAAGCAGAA 
CTTAATATCT GTAATGCGAA AGATGCCTTA AAAGAAATCG TAATTAATTT AGACTTAACT
CCGGAGGAAA GGGCAGGATT AGACTCTGAA ATTTCTGGGT TAGAGTCAAT GTTAGAAAAG
TTAGAAAAAG CTTCTGTTTA TATTGCTGTA TTTGGCATGG TAGGCCGAGG TAAGTCTTCA
CTTCTTAATG CTTTATTAGG AGAAAAAGTA TTTGAAACTG GTCCAATTCA TGGTGTTACT
AAAACGACAA AAATTAGACA ATGGCAATTA GGAAGAGATA ATTTAACGGA AATAGAATCA
TCATCTACAA ATTCTTTAAC TTCCTACAGA ATATCTCAAG TGGAATTAAT TGATACTCCT
GGTATAGATG AAGTAGACGG AGAAACTAGG GAATTAATGG CCCGACAGGT GGCAAAACAG
GCAGATTTAC TTTTATTTGC TGTAGCGGGA GATATTACCC AAGTAGAGTA TGAAGCTCTT
TCTTATTTAA GAGATGCTGG TAAACCAATT TTACTTGTGT TTAACAAAAT AGACCAGTAT
CCAGAAACAG ATAGGTTAGC TATATATAAT AAAATTCGCA ATGAGCGAGT TCGTGAATTA
CTCTCACCAA ATGAAATAGT TATGGCTGCT GCTTCTCCAC TACTTCCTAA AGCGGTGCCA
GCTTCTGATG GTAGTCTCAG AGTAGAGATG GTGCGAGGAG AACCTCAAGT AGAAGACTTA
AAGCTGAAAA TATTGGAAAT TTTAGATGGA GAAGGTAAGT CTTTGGTGGC CCTCAATAGT
ATGCTCTATG CTGATGATGT GAATGAACAG TTAGTACGCC GGAAAATGGA AATTCGCTCT
CAAAGTGCAG ATGGGGTTAT ATGGAAGGGG GTGATGACAA AAGCAATGGC GATCGCTCTC
AACCCTGTGA CTGTGGTAGA TATTCTCAGT GGGGCGATTA TTGATGTATC TATGATATTG
ACTTTATCAA AGTTATATGG TATAAAAATG AATCAAAGAG GAGCATTAGA CTTATTACAA
AAGATAGCTA TTAGTATGGG TGGTATAACT GCGAGTGAAT TAGTGGCAAA TCTAGGATTA
AGTTCTGTGA AAGGATTATT GGGTTTAGCT GCACCTGTTA CTAGTGGGTT ATCTTTAGGG
CCCTATTTGT CTGTAGCAGC GACTCAAGCA GCAGTAGGAG GTGTATCATC TTATGGAATT
GGGCAAATAA CTAAAACATA TCTTGCTAAT GGTGCTTCTT GGGGAGAGGA AAGTCCGAAA
ACAGTAGTTA ATAATATTTT GGCATCTTTG GATGAAAATT CGATTATGGG TCGTATTAAG
GAAGAATTAA TAGCTAAATT AAATATCAAA AATTTTAGTA ATCAATAA
 
Protein sequence
MNLDRELDET ISSFADIQAE LNICNAKDAL KEIVINLDLT PEERAGLDSE ISGLESMLEK 
LEKASVYIAV FGMVGRGKSS LLNALLGEKV FETGPIHGVT KTTKIRQWQL GRDNLTEIES
SSTNSLTSYR ISQVELIDTP GIDEVDGETR ELMARQVAKQ ADLLLFAVAG DITQVEYEAL
SYLRDAGKPI LLVFNKIDQY PETDRLAIYN KIRNERVREL LSPNEIVMAA ASPLLPKAVP
ASDGSLRVEM VRGEPQVEDL KLKILEILDG EGKSLVALNS MLYADDVNEQ LVRRKMEIRS
QSADGVIWKG VMTKAMAIAL NPVTVVDILS GAIIDVSMIL TLSKLYGIKM NQRGALDLLQ
KIAISMGGIT ASELVANLGL SSVKGLLGLA APVTSGLSLG PYLSVAATQA AVGGVSSYGI
GQITKTYLAN GASWGEESPK TVVNNILASL DENSIMGRIK EELIAKLNIK NFSNQ