Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1566 |
Symbol | |
ID | 4810073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1894269 |
End bp | 1895612 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640106984 |
Product | nitrogenase |
Protein accession | YP_001037985 |
Protein GI | 125974075 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGAT CCGGATTGAT TGAACAAGAA CGTTTTACCT GTGCAATCGG TGCTTTGCAA ACCGTGGTGG CTATTCCGCG GGCCGTGCCG ATTCTTCATT CCGGTCCCGG CTGCGGCGAG ATGATTGCCG GATTTTTTGA ACGGTCAACG GGATACGCCG GCGGTTCCAC ATCTCCCTGC ACAAACTTTA CAGAAAAAGA AGTTGTGTTT GGCGGAATCA ACCGGCTGAG GGATATTATA GAAAACACCT ACAAAGTATT GGATACGGAT TTGCAGGTGG TTCTGACCGG CTGCACCGCC GGTATTGTCG GAGACGATGT GGACAGCCTT GTTTCTGAAT TTGCCCAAAA GGGTAAGCCG ATTGTATCCG TGGAAACTGC AGGATTTAAG GCCACCAACT TTGAAGCTCA CAGCCTTGTG GTTAATGCCA TTATAGATCA ATATGTAAGC CGGTTTGAAG ATGAGAATAA GCCAAAATCG CAAAAAAACA CAGTGAATCT TATAGCCTCC ATTCCGTATC AGGATCCGTT TTGGAAGGGT AATCTGGCCG AATACAAGCG TCTGCTTGCC GGTATTGGGC TTAAAGCCAA TGTTTTATTC GGACCCCAGT CGGGAGGTGT AAAAGAGTGG CAGTCCATAC CGACAGCACT TTTCAATATT TTAGTTTCCC CATGGTACGG AAAACCCATT GCGGATCACC TCAAATCCAA ATACGGGCAG GAATATACAT GGTTTCATCA CATTCCCATA GGTGCCAATC AAACCGAGGC GTTCCTTAAT CAAGTTGTGG AATTTGCCAT CGAACAAGGA GCAGATATTG ACAAAGAATC AGCCCAGGAG TTTATCCGTC ACGAGTCCCA TGCCTACTAT GAGGAGATTG ATAACCTTGC CACCTTCCTT TTGGAGTTTC GCTACGGTCT TCCCAACCAT GCCCATATCC TTCATGACGC GGGATATGTC GTCGCACTGT CTAAGTTTTT GCTGCACGAG GTGGGAATTG TACCAAAGGA ACAATTTATT ACCGATGCTA CACCGGAAAA ATTCCATGAA GCCATTCGCG CCGATTTGAA AAGCACCAGC GATAAAAAGG AAATTCCGCT TTATTTTGAG CCCGATGCGG GAAAGGCGCA GGAGATTCTT AGGGGAATCC ATCATAAAGG AAGGGGTCTT ATCATCGGTT CAGGATGGGA TAAGGAACTG GCAAAAGAAA AAGGCTATGA TTTCCTTTCA GCTGCTTTAC CTTCTCCCTA CCGGTTGGTC TTGACAACCA ATTACGCAGG ATTTACAGGA GGGCTTCGGG TTATAGAGGA CATCTACCAG ACGGTTCTTT CAACCTATGC ATAA
|
Protein sequence | MSRSGLIEQE RFTCAIGALQ TVVAIPRAVP ILHSGPGCGE MIAGFFERST GYAGGSTSPC TNFTEKEVVF GGINRLRDII ENTYKVLDTD LQVVLTGCTA GIVGDDVDSL VSEFAQKGKP IVSVETAGFK ATNFEAHSLV VNAIIDQYVS RFEDENKPKS QKNTVNLIAS IPYQDPFWKG NLAEYKRLLA GIGLKANVLF GPQSGGVKEW QSIPTALFNI LVSPWYGKPI ADHLKSKYGQ EYTWFHHIPI GANQTEAFLN QVVEFAIEQG ADIDKESAQE FIRHESHAYY EEIDNLATFL LEFRYGLPNH AHILHDAGYV VALSKFLLHE VGIVPKEQFI TDATPEKFHE AIRADLKSTS DKKEIPLYFE PDAGKAQEIL RGIHHKGRGL IIGSGWDKEL AKEKGYDFLS AALPSPYRLV LTTNYAGFTG GLRVIEDIYQ TVLSTYA
|
| |