Gene Tmel_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_1849 
Symbol 
ID5297874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp1834363 
End bp1836207 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content37% 
IMG OID640770117 
Productextracellular solute-binding protein 
Protein accessionYP_001307069 
Protein GI150021715 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000095247 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAGA TTTTGGTATT TTTGATGGTA GTTTTTGCTG TAGCATTATT TGCGGCGGAT 
CCAAACGTAT TAGTGGATGC AACTATTGGT GAACCAGATA CTCTTGATCC ACATTTGGCA
TACGATACGG CAAGCGGTGA AGTCATTTAT CAGGTGTATG AGAATCTTGT TCAGTACGAT
GGTTCAAGTC TTAACAAATT CTTACCAAGG CTTGCTGCAG AAGTTCCAAG TGTTGAAAAT
GGTTTAATTA GAGATGGTGG TAAAACATAC GTGTTCCCAA TTAGAAAAGG GGTTAAATTC
CACAACGGTA ATGATTTAAC GCCAGAAGAT GTAGAATACA GTTTTGAAAG AGGTTTGCTT
TACAATCCAG CAGGTGGACC TGTGTGGATG CTCTGGTATT CAATATTTGG TGTTTGGAGT
GTAGAGGAAT TTGTAGAAGA AGTTTCAGGA AAATCATATG ATGAGTTATT TGATGAAAAT
GGGAATCCAC TTCCAGGTGC CGAAGAAGTA TTCCAAAAAG TTTACGAGGA AGTTGACAAA
GCTATTGAAG TAGATGGGGA TAATGTTGTA TTCCATCTTA GAAGACCATA TGGGCCATTT
CTAAACGTCA TAGCACAAGG TGGGCATTGG GGTGCAATAT TGGATAAAGA ATGGTGTATC
GAACAAGGAC TTTGGGATGG TCAACCAACC ACCTGGTGGA AATGGCACAA TCAAAGGAAA
GAAGATTCTC CATTGTATCA GAACGCAATG GGAACAGGTC CATACAAATT TGTTGAGTGG
GATAGAGCAC AACAAAAGGT TATATTAGAA GCAAATGAAA ATTACTGGAA AGAACCTGCA
AAGATCAAAA AAGTAGTAAT TTGGGGGATT GATGAATATT CCACAAGAAA GGCTATGCTT
GAAAAAGGAG ATGCCGATAT TGCATATATT CCAACACAAT ATCTTGATCA AGTAAGAGGA
AATCCTGACA TTGAAATTAT AGAAGGACTT CCAACGCTTT CTATTACAGT ATTAGCATTT
AATTGGTCAA TTAGAGAAGA TAGTAAATAT ATTGGAAGCG GAAAATTAGA TGGTAATGGT
ATTCCAGTTG ATTTCTTTAA CGATGTACAC GCAAGAAAAG CAATAGCACA TGTAATTTAT
TATGATGCGC TTATCAATGA TGTGTTGAAA GGCTTTGGAA AAAGAATACC AACGGCGTTA
CCAGAAGGTT TACTTGGATT TGATCCATCA TTACCACTTT ATGATTTTAA TTTGATTAAA
GCAAGACAAG AATTAATGCA AGCATGGAAT GGTGAAGCAT GGAAAAAAGG ATTTAAATTT
TCAGTTGCAT ATAACTTGGG TAACGAAGCA AGACAAAGAA CTGCTGAAAT GGTTAAAATG
TATCTTGAAA TGTTAAATCC AAAGATAAAG GTTGATGTTG TTGGATTGCA ATGGCCAACA
TTCTTAGATG CGACAAAACG TGGTGAACTT CCAATATTCA TTCTTGGATG GCTTGCAGAC
TATCCCGACC CAGATAACTT CATATTCACG TATTACGATT CGAAAGGTGA TTATAGCTCA
AGACAAGGTA AAAACTTCCA AGTGTTTGTA AGTACACCGC GTCCAGAACT TGGTGGTAAG
AGCTTAGATG ATCTTATCGA AGAAGCAGCA GCTGAAACTG ATGCTGCAAA AAGGGCGGAA
TTGTATGCAA AGGTACAAAA ATTTGTAGTT GAAAATGCAA TTAGTGTTCC ACTTTATCAA
CCTATTGGTG TGAGGGTTCA CAGAAAATGG CTCAAAGGTT GGTATCCAAA CGCAATGAGA
CCAGGTGACG ATTATTACGC ATACTGGTTT GAAGGAAAAG AATAA
 
Protein sequence
MKKILVFLMV VFAVALFAAD PNVLVDATIG EPDTLDPHLA YDTASGEVIY QVYENLVQYD 
GSSLNKFLPR LAAEVPSVEN GLIRDGGKTY VFPIRKGVKF HNGNDLTPED VEYSFERGLL
YNPAGGPVWM LWYSIFGVWS VEEFVEEVSG KSYDELFDEN GNPLPGAEEV FQKVYEEVDK
AIEVDGDNVV FHLRRPYGPF LNVIAQGGHW GAILDKEWCI EQGLWDGQPT TWWKWHNQRK
EDSPLYQNAM GTGPYKFVEW DRAQQKVILE ANENYWKEPA KIKKVVIWGI DEYSTRKAML
EKGDADIAYI PTQYLDQVRG NPDIEIIEGL PTLSITVLAF NWSIREDSKY IGSGKLDGNG
IPVDFFNDVH ARKAIAHVIY YDALINDVLK GFGKRIPTAL PEGLLGFDPS LPLYDFNLIK
ARQELMQAWN GEAWKKGFKF SVAYNLGNEA RQRTAEMVKM YLEMLNPKIK VDVVGLQWPT
FLDATKRGEL PIFILGWLAD YPDPDNFIFT YYDSKGDYSS RQGKNFQVFV STPRPELGGK
SLDDLIEEAA AETDAAKRAE LYAKVQKFVV ENAISVPLYQ PIGVRVHRKW LKGWYPNAMR
PGDDYYAYWF EGKE