Gene Tery_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1458 
SymbolproX 
ID4245772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2206086 
End bp2207102 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content32% 
IMG OID638106611 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_721221 
Protein GI113475160 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.104794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CACCGCTAAA AATAATCTTA GGCACAATTA TAACTAGCTT ATTAATAATT 
TTTACCTCCT GTCAACAAGT ACCCAATCCC TCAAGTGTCA TTATTAGTAG TGCTTATAAT
AGTGGTTGGA TAGAAGAATT ATTTCAAACT GAAATTGTTA ATATTGGTCT AAAACAATTA
GGCTACAAAG TCAATAAACC AAAACAAAGT GATTATCCAG TAATTTATAT TTCTATTGCC
AATGGAGACT TAGAATACAG CACAGTTTAC TATCAACCAG GGCATGAAAA ATTCTTCACA
AATGCTGGTG GAGAAAAAAA ATTAGAGGGA GTTGGATTTT TAACACCCCC AGGAAAGCAA
GGATATCAAA TTGACAAAAG AACGGCAGAT AAATATAATA TTACTAATCT TAAACAATTA
AAAGATCCAG AAATAGCCAA ACTTTTTGAT TCAGATAAAA ATGGCAAAGC AAATTTAGTT
GGTTGTAATC CTGGTTGGGC TTGTGAGTTA AATATAGACC ATCAACTTGA AAAGTATGAA
CTGGAAGATA CAGTAGAACA TAATAGTGGA CAATATACAG TGCTTCTGGC AGATGCTATT
ACTCGTTATA AGAAAGGCGA ATCTATTCTC TATTATGCCT ATAATCCCCA CTGGATATCT
GCAGTTTTAA AACCAGGGGA AGATGTAGTC TGGTTAACAG TTCCTTTCAC ATCTTTACCA
AATAATATGG CAAGTCTAAG TGAAAAAGAT ACGTCAGTAG ATGGAAAAAA TCTTGGTTTT
CCAAGAAGTA AACAGAGCAT TGTGGCTAAT AAAAAGTTTA TAGAATCTAA TCCAGTAGCT
AAAAAGTGGT TTGAATTAGT AGAAATTCCA ATAGCAGATA TGAATAGGGA AAGTTTACGC
ATTAAAGAGG GTGAAGACAA AGAAGAAGAT ATTTTGCGTC ATGCTCAAGA ATGGGTAAAA
AATAATCAGG AAAAATATGA TACTTGGTTA AAAATTGCTA GACAAGCATC TAATTAA
 
Protein sequence
MKKTPLKIIL GTIITSLLII FTSCQQVPNP SSVIISSAYN SGWIEELFQT EIVNIGLKQL 
GYKVNKPKQS DYPVIYISIA NGDLEYSTVY YQPGHEKFFT NAGGEKKLEG VGFLTPPGKQ
GYQIDKRTAD KYNITNLKQL KDPEIAKLFD SDKNGKANLV GCNPGWACEL NIDHQLEKYE
LEDTVEHNSG QYTVLLADAI TRYKKGESIL YYAYNPHWIS AVLKPGEDVV WLTVPFTSLP
NNMASLSEKD TSVDGKNLGF PRSKQSIVAN KKFIESNPVA KKWFELVEIP IADMNRESLR
IKEGEDKEED ILRHAQEWVK NNQEKYDTWL KIARQASN