Gene Tery_4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4188 
Symbol 
ID4245840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6453948 
End bp6455345 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content32% 
IMG OID638109087 
Productextracellular solute-binding protein 
Protein accessionYP_723665 
Protein GI113477604 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACGA AGAGTAAATT ATGTAAATTA ATTGGGTTAT TTATACTTGC TTTAATATTA 
GTTACTTGTA GGGTTAATCC AAATAAGGAT AATATCCAAA AGCCTCAATC AGATCAAGTA
TTAACAATTT GGTGGAATCG AGGATACTAT CCGGAGCAAG AAGAAGCTCT TAAAAAAGTA
GTTGTTGACT GGGAGGAAAA AACAAATAAC AAAGTTAAAC TTTTGTTTTT CAGTGAGGAT
GATATATTAC AAGCAGCTAT TGATGCTTTG GAAGTAGGTA AAACACCTGA TATTCTCTTT
TCCGAAAGAG CAGAGTTTAC TTTAATTCCC CAGTGGGCTA GAGAAGGAAA ATTAGTAGAT
GTTTCAGATG TAATTAAGCC TGTAAAAAAA TCCTATGATA CAACTGCACT GAACTCTTCT
TATTTGTACA ACCAAGTTAA GTCTAAATCT TCTAATTATA CTGTGCCAAT AATGCAGCAA
ACTCTCCATG TTCATTACTG GCTTGATTTA ATTAGTGAAG CTGGTTTGGG TGAAGAAATA
CCAGAAGAGT GGGACGAGTT TTGGCAGTTT TGGCAAAAGG CACAAAAAGT TTTGCGTGAA
AAAGGTCAAG ATAATATTTA TGCTTTAGGT TTGCCTATGT CTATTAATAG TACAGATACT
TACATTATAT TTAAACAAAT CTTGGAAGCT AACAATTTAC AAATAGTAGA TAAACAGGGA
AAATTACAAG TCGATAAACC AGAAATAAGA CAAAAAATTA TTGATATTTT AGACTGGTAT
ACTAGCTTTT ATAAAAATGG GTATGTGCCA CCTAAAGCAG TTAACTGGAG TAATTCTGAT
AATAATATTA GTTTCCTCAA TCAGAATACT TTAATGACAA TTAATCCAAG TATGTCAATT
CCTGGTTCTC AACAAGAAGA TGAAGAAATT TACTTAAATA AAATGAGAAC TATAGAGTTT
CCGAATAATC CTAATGGAGG TGCTCCAACA TATTTAGTAT CTGTCAAAGA ACCTATAATT
TTTACATCTT CTCCTAATCC ATTATTGGCG AAAAATTTTC TTTCATACTT AGTAAAACCA
GATAATTTAG GACCTTATAT TAAAGGTGCA AAAGGTCGTT ATTTTCCAAT TATGCCTAAG
CTCTGGAAAG ACCCTTTTTG GAGTAATATA AAAGATCCCC ATATTTCTGT TGCTTCTCAA
CAATTTACTA AATCTCAGAC TCGTTTACTT CACAATTCTA TCAACCCAGC TTATTCTCAA
ATAGATTCAG AAAATATTTG GGGAAAAGCC ATGGCAAAAG TGTTAATTGA AGGCTTATCA
CCAACTGCTG CAACAGACCA AGCTATTAAT CAAATTAAAG AAATATTTGC TCAGTCAAAA
ACTCAGAATG AAAGGTAA
 
Protein sequence
MITKSKLCKL IGLFILALIL VTCRVNPNKD NIQKPQSDQV LTIWWNRGYY PEQEEALKKV 
VVDWEEKTNN KVKLLFFSED DILQAAIDAL EVGKTPDILF SERAEFTLIP QWAREGKLVD
VSDVIKPVKK SYDTTALNSS YLYNQVKSKS SNYTVPIMQQ TLHVHYWLDL ISEAGLGEEI
PEEWDEFWQF WQKAQKVLRE KGQDNIYALG LPMSINSTDT YIIFKQILEA NNLQIVDKQG
KLQVDKPEIR QKIIDILDWY TSFYKNGYVP PKAVNWSNSD NNISFLNQNT LMTINPSMSI
PGSQQEDEEI YLNKMRTIEF PNNPNGGAPT YLVSVKEPII FTSSPNPLLA KNFLSYLVKP
DNLGPYIKGA KGRYFPIMPK LWKDPFWSNI KDPHISVASQ QFTKSQTRLL HNSINPAYSQ
IDSENIWGKA MAKVLIEGLS PTAATDQAIN QIKEIFAQSK TQNER