Gene Tpet_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0390 
Symbol 
ID5170974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp378579 
End bp380318 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content50% 
IMG OID640562891 
Productextracellular solute-binding protein 
Protein accessionYP_001243992 
Protein GI148269532 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGGT TTCTTGGGAT TTTCCTTATG ATTGCAGCCG TTGCACTTTT CGCAGAACTT 
TCACCGTTCG CTCAGTTCGA AACGTACATC GGAGCAGAGT TCACAGGACA GTACGGGGGA
GTGCTCGTAG TACCGACGCT TTCTGGTCCC AGAACGTGTA ACTATGTCGT GGCTCAGGAG
ACGAGCTCCA CAGATGTTAT AGCGAGGTTC ATGGCTTCCA TGATCGAGCT CGACAATCAC
GCGAGGATCC ATCCCGCTCT TGCAGAAAGC TGGGAACTCA TCCAAAACGA GGACGGAAGC
ATGGAAATCG TTTGGCACCT GAGGAAGGGT GTCAAATGGA GTGACGGAAC ACCGTTCACA
GCAGACGATG TGGTCTTCAC AATTAACGAT GTCTATTTCA ACCCTGACAT ACCGAACGAT
ATGCAGGATC TCTTTGCAGA CAACTGGCCA GTAGCCGAAA AGCTCGACGA TTACACGGTG
AAAACCACTC TCAAAGAAAC GTACAGACTC GCGGTAAGGT ACATTGGAGG TATTCCCATC
TTCCCAAAAC ACCTCGCGGA ACCGTACGTG AAGGAAGGAA AATTCAAGGA ATTCTGGACG
GTTGACGCCA TCAACAAGGG AGAAATCGTA GGACTTGGTC CGTTCATTCC GGTTGAGTAC
GTTCCCGATC AGTACGTGAG ATTCGTGAAG AACCCCTACT ACTGGAAGTA CGACAAAGAA
GGAAAGCAGC TTCCTTATCT CGACGGTATC ATCTTCAAGA TCATTCCGAC ACAGGATGCG
CAGAGACTCG CGTTTGAAAA CGGAGAAGTC GATGTGTATG GACCTCGCGG AACAGAGTAC
GCAGAACTCA AAGCCATGGC AAGAGAAAAA GGCTGGGTTG TGGGTATTGG AGGCCCGAAT
TTTGGAACCA CTTTCATCAC TTTCAACTGG AACGCTCCCG ATCCTGTGAA GCGGAAGTGG
TTCAGGAACG ATTTCTTCAG AAGAGCTGTG GCGTACGCCA TCGACAAACA ATCCATGATA
GACACGCTCT ACAACGGCCT CGCGGTTGAA CAGTGGGGTC CCATCAGTCA GGCTGCGACC
GTTTATTACG ACGAATCCGT TCTCAGAAAA TACCCGTACA ACCTTGATCT CGCCAGAACG
ATGCTCAAGC TCGGTGGTTT CAAATGGGAT GAAAACGGCC AGCTCCTCGA CAGCGAAGGA
AACCCGGTGA AGTTCATCAT CATGACTAAC GCCGGACACC AGATCCGCGA AGGAATGGGT
AACATCATAA CAGAAGCGCT CAAAAAACTC GGAATGGATG TCACCTTTGC TCCCATCGAT
TTCAACACGC TCGTCCAGAA GCTCGTTGTG AACGGTGACT GGGAATCTAT CATTATAGGA
CTCACCGGAT CCGATGAACC TCAGGGAGGA GCCAACGTCT GGAGAATCAA GGGATCTCTC
CACTTCTGGA ACTACCATCC GGAGGTCAAA GACTTCGTCG ATCCCAACGA TTACTACCTG
CCAGACTGGG AAAAAGAGAT CGACAGAATC TTCGAAGAGA ACGTGAAGAT ACTCGATCAG
CAGAAAGTCG TAGATATGTT CAGAGAATTT CAGAGGCTCG TGTCTGAACA CATACCGCTC
ATCTACACCA CCCAGCAGCT CTATCTCTAC GCCTACAGCA ACAAACTTCA CAACGTGGAA
CCCACCGCTT TCGGCGGTGT GTGGGGTTGG AACCAGGATT GTGTCTGGAA AGAGCAGTAA
 
Protein sequence
MRRFLGIFLM IAAVALFAEL SPFAQFETYI GAEFTGQYGG VLVVPTLSGP RTCNYVVAQE 
TSSTDVIARF MASMIELDNH ARIHPALAES WELIQNEDGS MEIVWHLRKG VKWSDGTPFT
ADDVVFTIND VYFNPDIPND MQDLFADNWP VAEKLDDYTV KTTLKETYRL AVRYIGGIPI
FPKHLAEPYV KEGKFKEFWT VDAINKGEIV GLGPFIPVEY VPDQYVRFVK NPYYWKYDKE
GKQLPYLDGI IFKIIPTQDA QRLAFENGEV DVYGPRGTEY AELKAMAREK GWVVGIGGPN
FGTTFITFNW NAPDPVKRKW FRNDFFRRAV AYAIDKQSMI DTLYNGLAVE QWGPISQAAT
VYYDESVLRK YPYNLDLART MLKLGGFKWD ENGQLLDSEG NPVKFIIMTN AGHQIREGMG
NIITEALKKL GMDVTFAPID FNTLVQKLVV NGDWESIIIG LTGSDEPQGG ANVWRIKGSL
HFWNYHPEVK DFVDPNDYYL PDWEKEIDRI FEENVKILDQ QKVVDMFREF QRLVSEHIPL
IYTTQQLYLY AYSNKLHNVE PTAFGGVWGW NQDCVWKEQ