Gene Tery_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1459 
SymbolproX 
ID4245773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2207730 
End bp2208740 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content31% 
IMG OID638106612 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_721222 
Protein GI113475161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0408344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AAAACCTAAA AATTATCTTA GGAACAATTT TAACGAGCAT ATTAATAATT 
TTTACATCTT GTCAACAAGT TCCTAATTCT CAAACGGTAA CTATTCGGAC TGCCCATAGT
ACTTGGATAG AAGAGTCATT TCAAACTAAT ATTGTTAATA TTGGTCTAGA AAAACTCGGG
TATAAGGTCG AAGAACCAAA ACAAATTGAA TACACAGCTA TCTATATTTC TATTGCTAAT
GGAGACTTAG AATATAGTAC TATTTACTAC CAACCATCTC ATGAAAAATT CTTTGAAAAA
GCTGGTGGTG AAGAAAAGTT AGAGGCAGTT GGAATTTTGA CACCTGATGG GATAGCTGGA
TATCAAATAG ACAAAAAAAC AGCGGATAAA TATCAAATTA CTAACATTAA AGAATTAAAA
AATCCAGAAA TAGCCAAGCT TTTTGACTGG GATAAAAACG GTAAGGCAAA TTTAGTTGGT
TGTAATCCTG GTTGGGCTTG TGAGTTAAAT ATAGACCATC ACATCGAAGC TTATGGACTA
GAAAATACAG TAGAACATAG TCGAGGTCAA TATAATATTC TATTAGTAGA TGCTATAACT
CGCTATAAAC AAGGGCAACC TATTCTTTAT TTTGCCTATA ATCCTCATTG GATATCTGCT
ATCTTAAAAC CAGGTAAAGA TGTAGTTTGG TTAGAAGTGC CTTTTACCTC TTTACCAGAT
ATGAGAAATA TCACAAAAAA AGATACATTA CTAGATGGAA AAAATATTGG TTTTTCTAGA
ACTCAACAGA GCATTGTGGC TAATAAAAAA TTTTTAGAGT CTAACCCAGT GGCTAAAAGG
TGGTTTGAAT TAGTAGAAAT TCCAGTTGCA GATATGAATA CTGAAAGTTT ACGAATTAAA
GAAGGTGAAG ACAAAGAAGA AGATATTTTA CGTCATGCTC AAGAGTGGAT AAAAAATAAT
CAAGAAAAAT ATAATAATTG GTTAGAAATA GCAAAAAAAG CAGCTAATTA A
 
Protein sequence
MKQKNLKIIL GTILTSILII FTSCQQVPNS QTVTIRTAHS TWIEESFQTN IVNIGLEKLG 
YKVEEPKQIE YTAIYISIAN GDLEYSTIYY QPSHEKFFEK AGGEEKLEAV GILTPDGIAG
YQIDKKTADK YQITNIKELK NPEIAKLFDW DKNGKANLVG CNPGWACELN IDHHIEAYGL
ENTVEHSRGQ YNILLVDAIT RYKQGQPILY FAYNPHWISA ILKPGKDVVW LEVPFTSLPD
MRNITKKDTL LDGKNIGFSR TQQSIVANKK FLESNPVAKR WFELVEIPVA DMNTESLRIK
EGEDKEEDIL RHAQEWIKNN QEKYNNWLEI AKKAAN