Gene Cthe_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1297 
SymbolcobT 
ID4809549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1574604 
End bp1575659 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content46% 
IMG OID640106720 
Productnicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 
Protein accessionYP_001037722 
Protein GI125973812 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2038] NaMN:DMB phosphoribosyltransferase 
TIGRFAM ID[TIGR03160] nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTTC AAACACTGAA ATCCATAGGA GAGCTGTATA AAGAACCCAT GGACATGGTT 
CAAAGGAGGC TCGACAGCCT TTCGAAGCCG TTGGGGAGCC TGGGAAGGCT GGAAGATATC
ATTAAAAAGC TTGCAGGTAT AACCGGAGAA GTTTTTCCGT GTGTTGATAA AAAAGCGGTT
ATTATAATGT GTGCGGACAA CGGAGTTGTG GAAGAGGGGA TAAGTTCCTG CCCGAAAGAT
GTTACTTCCA AAGTGACCAG GAATTTTCTG AAGGGTATAA CAGCCATAAA TGCTTTTGCA
AAGCATACAG GTTCCGATAT TGTAGTGGTT GATATCGGAG TGGATGATGA CATGGACTGT
GAAGGAATTG TAAAGCGCAA AGTAAGAAAA GGTACCTGGA ATATTGCAAA AGGGCCCGCA
ATGACGCGCA AAGAGGCAAT AGAGGCCATA GAGGTCGGAA TTTCCATTGT GGAGGAACTT
GGCAGGAAAG GAGTAAATCT TTTAGGCACG GGAGAAATGG GTATTGGCAA TACCACGACC
AGCAGCGCGG TTTCAACGGT TTTGACAGAT TCAAAAGCTG AGAATATGGT GGGCAGGGGA
GCAGGCCTTT CGGATGAAGC GCTGAAAAGA AAGATTTCGA TTGTCAAAAA GGCTATAGAT
TTAAACAGAC CCGATGCAAA CGACCCTATT GACGTTGTTT CAAAAGTGGG CGGGTTTGAT
ATTGCAGGCC TTGCAGGCTG CTTTATCGGT GCAGCGGCAT GTAGAATTCC GATCCTTATT
GACGGATTTA TATCTGCAAC AGCTGCCCTT GCAGCAGTAA GGATGGAGCC GAAGGTCAAA
AATTTCATTT TTCCTTCCCA TGGTTCAGCA GAACCCGGAA GCAAAAAAGT TATGGAAGCG
TTGGGATTTG AACCTATACT GAATCTGGAG ATGAGAGTCG GAGAGGGCAC CGGTGCGGCA
CTGGCATTTC ATATTTTTGA CTGTGCCGTG TCGGTATACA GGAACATGGG CACATTTGAG
GATGCATGTA TTGAACAATA TCAGCCTCAG GTGTAA
 
Protein sequence
MLFQTLKSIG ELYKEPMDMV QRRLDSLSKP LGSLGRLEDI IKKLAGITGE VFPCVDKKAV 
IIMCADNGVV EEGISSCPKD VTSKVTRNFL KGITAINAFA KHTGSDIVVV DIGVDDDMDC
EGIVKRKVRK GTWNIAKGPA MTRKEAIEAI EVGISIVEEL GRKGVNLLGT GEMGIGNTTT
SSAVSTVLTD SKAENMVGRG AGLSDEALKR KISIVKKAID LNRPDANDPI DVVSKVGGFD
IAGLAGCFIG AAACRIPILI DGFISATAAL AAVRMEPKVK NFIFPSHGSA EPGSKKVMEA
LGFEPILNLE MRVGEGTGAA LAFHIFDCAV SVYRNMGTFE DACIEQYQPQ V