Gene Tmel_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_1386 
Symbol 
ID5298262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp1388719 
End bp1390455 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content35% 
IMG OID640769660 
Productextracellular solute-binding protein 
Protein accessionYP_001306618 
Protein GI150021264 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.37673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC TTGCCGTATT TTTGGTTGTA CTTGCAGTGG TATTATCTTT TGCTGCTCAA 
CTTCCATATA TTGGGGCAGA TGCTAACGGT AAACCAGGTG GTCAGTTTGT GTTGGGAACA
TTAAGTGGTC CAAAAACTGT AAATGATGTG GCTGCTAAAG AAACTAGTTC TACCGATGTT
ATTGATATGT TTATGGGATA TGGAGGTACA TTGATTGAAA GACATGGTGT AGATATGAAG
TTCTATCCTG CTATTGCTGA AAACTGGGAA GGTCCAAGGC TTACAGCAGA TGGTGGTATG
GAAATTATTT GGCACATTAG AAAAGGTGTA AAGTTCAGTG ATGGTAACCC TTTAACAGCA
GATGATGTAG TATTTACTTT AAATGATATT TATACAAATC CAGATATTCC AAGTTCTATG
CAAGATATTT TAAAGAGTAC AAATGGATAT TTACCAAAGG CCGAAAAAAT TGACGATTAC
ACAGTAAGGA TGTACTACCC AGAACCATTT AGACTTGCGT TTAGATATTT AGGTGGGATG
TATATATTCC CAAAACATAT TGCTGAAGAA TACGTTAAAA ACGGTACATT CCAGGAATTT
TGGACAGTTG AAGCTATTAA CGAAGGGAAA ATTGTGGGGC TTGGTCCATA TATTCCTGTA
GAGTATGTTC CAGACCAATA CGTAAGGTTT GTAAAAAATC CATACTATTG GAAAAAAGAT
GCAAATGGCA ATCAACTACC ATATTTTGAT GAAGTAATTT ATAAAATTAT TTCCAATCAA
GATGCAATGA GACTTGCATT TGAAAATGGA GAAATTGATG TATATGTACC AAGAGGAACG
GAATTTGCAG AATTAAAAGA AAAAGAGAAG GAATTAAATA TCGTAGTTAC AACAGCAGGA
CCAGCTTATG GTACACTATT TATCACATTT AATTGGAATA CACCAGATCC GGTAAAAAGA
AAATGGTTTA GAAACGAATA TTTTAGAAAA GCTGTTGCAC ATGCTATAGA CAAACAATCA
ATTATCGATA CTCTTTACAA CGGTCTTGCA ATTCCACAAT GGTCACCAGT ATCAATGAGT
TCCCCATATT ACAATGAAGA TGTAGTAACA AAATATGAAT TCGATCTTGA CCTTGCAAGA
GCAATGCTTG AAATGGGCGG CTTTAGTTGG GACGAAAATG GACAACTTAT CGACGAAGAT
GGAAATCCAG TTAAATTCTT ACTTACAACA AATGCAGGGA ACAGAGTGAG AGAGGGTGCT
GCAAATATTA TTCAAGATGC TTTGAAGAAA CTTGGAATGG ATGTAACATT TACACCAATC
GATTTCAATA CTTTGGTACA AAAATTGTTG AATACAGGTG ATTGGGAGGC AGTAATTATT
GGTATAACTG GTAGTGATGA ACCACAAGGT GGAGCAAACA CATGGAAAAT TGACGGAGGA
TTACATTTCT GGAACTATTC ACCAGAAGTT GCAGAATATG TTGATGCAAA TGATTACTAC
TTGTCAGATT GGGAGCTTGA AATTGATAAG ATATTTAAAG AAAATGTAGC AATATTGGAT
GAAGATATTG TAAAAGACTA CTTTGCTAGA TTCCAACAAT TAGTATCCGA ACACTTACCA
TTAATTTACA CGGTTAATAC ATTGAGACTA TATGCTTACA AGGCAAACTT GAAAAATGTG
AAGATAGGTC CACTTGGTGG AATGACGTGG AACATTTATG AGGAATGGAA GAAATAA
 
Protein sequence
MKKLAVFLVV LAVVLSFAAQ LPYIGADANG KPGGQFVLGT LSGPKTVNDV AAKETSSTDV 
IDMFMGYGGT LIERHGVDMK FYPAIAENWE GPRLTADGGM EIIWHIRKGV KFSDGNPLTA
DDVVFTLNDI YTNPDIPSSM QDILKSTNGY LPKAEKIDDY TVRMYYPEPF RLAFRYLGGM
YIFPKHIAEE YVKNGTFQEF WTVEAINEGK IVGLGPYIPV EYVPDQYVRF VKNPYYWKKD
ANGNQLPYFD EVIYKIISNQ DAMRLAFENG EIDVYVPRGT EFAELKEKEK ELNIVVTTAG
PAYGTLFITF NWNTPDPVKR KWFRNEYFRK AVAHAIDKQS IIDTLYNGLA IPQWSPVSMS
SPYYNEDVVT KYEFDLDLAR AMLEMGGFSW DENGQLIDED GNPVKFLLTT NAGNRVREGA
ANIIQDALKK LGMDVTFTPI DFNTLVQKLL NTGDWEAVII GITGSDEPQG GANTWKIDGG
LHFWNYSPEV AEYVDANDYY LSDWELEIDK IFKENVAILD EDIVKDYFAR FQQLVSEHLP
LIYTVNTLRL YAYKANLKNV KIGPLGGMTW NIYEEWKK