Gene Tpet_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1542 
Symbol 
ID5170634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1532164 
End bp1534167 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content49% 
IMG OID640564069 
Productcarbohydrate binding module 27 
Protein accessionYP_001245126 
Protein GI148270666 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000596448 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGGT TTATGTTCAT TTTATCGATC GTTGCTCTCT CTTTCGTTCT CTTTGCAGAT 
GAGTTCGTGA GAGTGGAAAA CGGAAAATTC GTCCTTAATG GAAAAGAATT CAGATTCATT
GGGAGTAACA ACTACTACAT GCACTACAAG AGCAACAGAA TGATAGACAG TGTTTTGGAG
AGCGCCAGGG ATATGGGAAT AAAGGTGCTC AGAATCTGGG GTTTCCTCGA CGGGGAGAGT
TACTGCAGAG ACAAGAACAC CTACATGCAT CCTGAGCCCG GTGTTTTCGG AGTGCCGGAA
GGGATCTCAA ACGCCCAGAA TGGTTTCGAA AGACTCGACT ACACGATAGC GAAAGCGAAA
GAACTTGGCA TAAAACTCAT CATCGTTCTT GTGAACAACT GGGACGACTT TGGTGGGATG
AACCAGTACG TGAGGTGGTT CGGAGGAACC CACCACGACG ATTTCTACAG AGATGAAAGA
ATCAAAGAAG AGTACAAAAA GTACGTGTCT TTCCTCATAA ACCATGTCAA CGTCTACACG
GGAGTTCCTT ACAGGGAAGA GCCCACCATC ATGGCCTGGG AGCTTGCAAA CGAACTGCGC
TGTGAGACGG ACAAATCGGG GAACACGCTC GTTGAGTGGG TGAAGGAGAT GAGCTCCTAC
ATAAAGAGTC TGGATCCCAA CCACCTCGTG GCTGTGGGGG ACGAAGGATT CTTCAGCAAC
TACGAAGGAT TCAAACCTTA CGGTGGAGAA GCCGAGTGGG CCTACAACGG CTGGTCCGGT
GTTGACTGGA AGAAGCTCCT TTCGATAGAG ACGGTGGACT TCGGCACGTT CCACCTCTAT
CCGTCCCACT GGGGTGTCAG TCCAGAGAAC TATGCCCAGT GGGGAGCGAA GTGGATAGAA
GACCACATAA AGATCGCAAA AGAGATCGGA AAACCCGTTG TTCTGGAAGA ATATGGAATT
CCAAAGAGTG CGCCAGTTAA CAGAACGGCC ATCTACAGAC TCTGGAACGA TCTGGTCTAC
GATCTCGGTG GAGATGGAGC GATGTTCTGG ATGCTCGCGG GAATCGGGGA AGGTTCGGAC
AGAGACGAGA GAGGGTACTA TCCGGACTAC GACGGTTTCA GAATAGTGAA CGACGACAGT
CCAGAAGCGG AACTGATAAG AGAATACGCG AAGCTGTTCA ACACAGGTGA AGACATAAGA
GAAGACACCT GCTCTTTCAT CCTTCCAAAA GACGGCATGG AGATCAAAAA GACCGTGGAA
GTGAGGGCTG GTGTTTTCGA CTACAGCAAC ACGTTTGAAA AGTTGTCTGT CAAAGTCGAA
GATCTGGTTT TTGAAAATGA GATAGAGCAT CTCGGATACG GAATTTACGG CTTTGATCTC
GACACAACCC GGATCCCGGA TGGAGAACAT GAAATGTTCC TTGAAGGCCA CTTTCAGGGA
AAAACGGTGA AAGACTCTAT CAAAGCGAAA GTGGTGAACG AAGCGCGGTA CGTGCTTGCA
GGAAAGGTGG ATTTCTCTTC CCCGGAGGAG GTGAAAAACT GGTGGAACAG CGGAACCTGG
CAGGCAGAAT TTGAGTCACC TGACATTGAA TGGAACAGTG AGGTGGGAAA TGGTGCGTTG
CAGTTGAACG TGAAGCTGCC TGGAAAGAGC GACTGGGAAG AAGTGAGGGC AGCGAGGAAG
TTCGAAAAGC TCTCCGAATG TGAGATCCTC GAGTATGACA TCTACATTCC AGACGTCGAA
GGGCTCAAAG GAAGGTTGAG ACCGTACGCG GTTCTGAACC CCGGCTGGGT GAAGATAGGC
CTCGATATGA ACAACACAAG CGTGGAAAGT GCGGAGATCG TCACTTTCGG TGGAAAAGAG
TACAGAAAAT TCCACGTAAG GATTGAATTC GACAAGACAG CGGGGGTGAA CGAGCTTCAC
ATAGGAATTG TCGGTGATCA TCTGAAGTAC AATGGACCGA TTTTCATCGA TAATGTAAAA
CTCTACACAA AGGAGGCTGA ATAA
 
Protein sequence
MRRFMFILSI VALSFVLFAD EFVRVENGKF VLNGKEFRFI GSNNYYMHYK SNRMIDSVLE 
SARDMGIKVL RIWGFLDGES YCRDKNTYMH PEPGVFGVPE GISNAQNGFE RLDYTIAKAK
ELGIKLIIVL VNNWDDFGGM NQYVRWFGGT HHDDFYRDER IKEEYKKYVS FLINHVNVYT
GVPYREEPTI MAWELANELR CETDKSGNTL VEWVKEMSSY IKSLDPNHLV AVGDEGFFSN
YEGFKPYGGE AEWAYNGWSG VDWKKLLSIE TVDFGTFHLY PSHWGVSPEN YAQWGAKWIE
DHIKIAKEIG KPVVLEEYGI PKSAPVNRTA IYRLWNDLVY DLGGDGAMFW MLAGIGEGSD
RDERGYYPDY DGFRIVNDDS PEAELIREYA KLFNTGEDIR EDTCSFILPK DGMEIKKTVE
VRAGVFDYSN TFEKLSVKVE DLVFENEIEH LGYGIYGFDL DTTRIPDGEH EMFLEGHFQG
KTVKDSIKAK VVNEARYVLA GKVDFSSPEE VKNWWNSGTW QAEFESPDIE WNSEVGNGAL
QLNVKLPGKS DWEEVRAARK FEKLSECEIL EYDIYIPDVE GLKGRLRPYA VLNPGWVKIG
LDMNNTSVES AEIVTFGGKE YRKFHVRIEF DKTAGVNELH IGIVGDHLKY NGPIFIDNVK
LYTKEAE