Gene Cthe_2685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2685 
Symbol 
ID4808857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3168929 
End bp3169897 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content44% 
IMG OID640108104 
ProductnifR3 family TIM-barrel protein 
Protein accessionYP_001039077 
Protein GI125975167 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000366446 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG GGAATGTCAC CCTTGATAAT AATATTTTTC TTGCTCCCAT GGCGGGCATT 
ACCGATATTC CCTTCAGGCT TTTATGTAAA GAACAGGGAT GCGGATTGAC ATATACGGAA
ATGGTAAGCG CAAAAGGAAT TTACTATAAT GACGAGAAGA CCAAAAAGCT TACCGCGGTA
GATGCCGCCG AGGGAAAAGT TGCGCTGCAG ATTTTTGGTT CAGATCCGGT GATTATGGCA
AAAGTGACGG AACAACTTAA TGATTCTGAC GCATGCATTA TCGACATAAA CATGGGCTGC
CCGACACCGA AAATAACAAA AAACGGTGAC GGCTGTGCGT TAATGCGCCA GCCGGAGCTG
GTAGGGAAAA TAGTTCGGGA GGTTTCAAAG GCTTCAGTCA AGCCTGTCAC GGTGAAAATC
CGCAAGGGAT GGGATGAAAA CAGGATCAAT GCGGTGGAAA TAGCCAGGAT AGCCGAGGAG
AACGGCGCAG CGGCAATTAC GGTTCACGGA AGGACAAGGG AACAGTTTTA CAGTGGCAAG
GCGGATTGGA GCATCATAAG AGAGGTTAAG CAATCTGTCA GCATACCTGT AATAGGAAAC
GGGGATGTTT TTACGCCGGA AGATGCCAGG AGAATGTTTG AAGAGACAAA TTGCGATGCA
ATAATGATTG GCAGAGGTGC TCAGGGAAAT CCGTGGATTT TCCGAAAAAT AATAAAGTAT
CTTGAAGGCT CCGAGGATTT TGACCTGGAT ATATCCCTTG AAACTAAGAT AAACATAATC
AAGAGACATA TGCAAATGCT TGTTGAACTT AAAGGTGAGC AATGCGGAGT ACGGGAAATG
AGAAAACACA TAGCATGGTA TATAAAAGGT ATGCGCAACG CTTCACGTAT CAAGGAAAAA
GTATTTAAAG CGACAACTCA GCAAGAAGTT TTCAGCCTGC TTGATGAGCT TTTGGAATTC
AACATGTAA
 
Protein sequence
MKIGNVTLDN NIFLAPMAGI TDIPFRLLCK EQGCGLTYTE MVSAKGIYYN DEKTKKLTAV 
DAAEGKVALQ IFGSDPVIMA KVTEQLNDSD ACIIDINMGC PTPKITKNGD GCALMRQPEL
VGKIVREVSK ASVKPVTVKI RKGWDENRIN AVEIARIAEE NGAAAITVHG RTREQFYSGK
ADWSIIREVK QSVSIPVIGN GDVFTPEDAR RMFEETNCDA IMIGRGAQGN PWIFRKIIKY
LEGSEDFDLD ISLETKINII KRHMQMLVEL KGEQCGVREM RKHIAWYIKG MRNASRIKEK
VFKATTQQEV FSLLDELLEF NM