Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2685 |
Symbol | |
ID | 4808857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3168929 |
End bp | 3169897 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108104 |
Product | nifR3 family TIM-barrel protein |
Protein accession | YP_001039077 |
Protein GI | 125975167 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000366446 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAG GGAATGTCAC CCTTGATAAT AATATTTTTC TTGCTCCCAT GGCGGGCATT ACCGATATTC CCTTCAGGCT TTTATGTAAA GAACAGGGAT GCGGATTGAC ATATACGGAA ATGGTAAGCG CAAAAGGAAT TTACTATAAT GACGAGAAGA CCAAAAAGCT TACCGCGGTA GATGCCGCCG AGGGAAAAGT TGCGCTGCAG ATTTTTGGTT CAGATCCGGT GATTATGGCA AAAGTGACGG AACAACTTAA TGATTCTGAC GCATGCATTA TCGACATAAA CATGGGCTGC CCGACACCGA AAATAACAAA AAACGGTGAC GGCTGTGCGT TAATGCGCCA GCCGGAGCTG GTAGGGAAAA TAGTTCGGGA GGTTTCAAAG GCTTCAGTCA AGCCTGTCAC GGTGAAAATC CGCAAGGGAT GGGATGAAAA CAGGATCAAT GCGGTGGAAA TAGCCAGGAT AGCCGAGGAG AACGGCGCAG CGGCAATTAC GGTTCACGGA AGGACAAGGG AACAGTTTTA CAGTGGCAAG GCGGATTGGA GCATCATAAG AGAGGTTAAG CAATCTGTCA GCATACCTGT AATAGGAAAC GGGGATGTTT TTACGCCGGA AGATGCCAGG AGAATGTTTG AAGAGACAAA TTGCGATGCA ATAATGATTG GCAGAGGTGC TCAGGGAAAT CCGTGGATTT TCCGAAAAAT AATAAAGTAT CTTGAAGGCT CCGAGGATTT TGACCTGGAT ATATCCCTTG AAACTAAGAT AAACATAATC AAGAGACATA TGCAAATGCT TGTTGAACTT AAAGGTGAGC AATGCGGAGT ACGGGAAATG AGAAAACACA TAGCATGGTA TATAAAAGGT ATGCGCAACG CTTCACGTAT CAAGGAAAAA GTATTTAAAG CGACAACTCA GCAAGAAGTT TTCAGCCTGC TTGATGAGCT TTTGGAATTC AACATGTAA
|
Protein sequence | MKIGNVTLDN NIFLAPMAGI TDIPFRLLCK EQGCGLTYTE MVSAKGIYYN DEKTKKLTAV DAAEGKVALQ IFGSDPVIMA KVTEQLNDSD ACIIDINMGC PTPKITKNGD GCALMRQPEL VGKIVREVSK ASVKPVTVKI RKGWDENRIN AVEIARIAEE NGAAAITVHG RTREQFYSGK ADWSIIREVK QSVSIPVIGN GDVFTPEDAR RMFEETNCDA IMIGRGAQGN PWIFRKIIKY LEGSEDFDLD ISLETKINII KRHMQMLVEL KGEQCGVREM RKHIAWYIKG MRNASRIKEK VFKATTQQEV FSLLDELLEF NM
|
| |