Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1565 |
Symbol | |
ID | 4810072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1892797 |
End bp | 1894272 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640106983 |
Product | nitrogenase |
Protein accession | YP_001037984 |
Protein GI | 125974074 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.016954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAA TAAATTTATC CCTTCCGGAA GTACAGATAA GGGAAATTCG TATTAATTCA ATAACGGGTT ATCAGGGAGA TGCTAAGGAA CTGGTAGAAG CCCGCGAATT CGGTCTGAAG GATAAAGAAC GTTCCTTTAG CCAATGCCTG GGCTGTGCTA CCTCAAAAGC GGCCTGTATG ACTGTGTTAA TTCAGGACGC TGCAGTCATC AGCCATGGAC CGGTGGGCTG TGCTTCCTGT CTGCATGAAT TTGCCTTTAC CTATCGGGTG AATTATCCTT TGCGCGGTAT TGAACGTCCC ACACCACGCC GTATCTTTTC CACCAATCTA AAGGAAAAGG ATACAGTTTA CGGAGGAAAT ATAAAGCTTG CCAATACCAT TCGAGAGGTA TATGAGAGAA CGCATGCCAA CGCTATTTTT GTATTGACCA CATGCGCTGC CGGAATTATC GGCGATGATG TGGAAAGCGT TTGCAACGAA GCCGAGGAAG AGTTGGGAAT ACCGGTGGTA GCCATCTTTT GCGAAGGTTT TCGTTCCAAA GTATGGACCA CAGGTTTTGA CGCTGCTTAC CACGGCATTG CACGCAAGCT GATTCAAAAA CCCCGGAGGC GGCGGGATGA CATGATCAAT GTAATCAATT TCTGGGGCAG CGATGTGTTT TACGAATGGT TTGCTCCCTT TGGAGCAAAA CCCAATTACA TAATCCCTTT TTCTACAGTG AACGGATTAA AATATGCCAG CGAGGCCGCT GCCACCGTCC AGGCTTGCTC CACGCTGGGA AGCTACCTGG GAGCAGTGCT GGAACAGGAT TTTGGTGTTC CCGAAATTCC TGCCGCCCCA CCCTACGGTA TTGCACAAAC GGATAGATGG TTCAGGGCGT TGGGAAAGAT CCTCGGCAAA GAAGAAATTG CTGAAAAAAT CATTGCGGAA AAAAAGAAAG AGTATCTGCC CAAAATTGAA GCTCTACGGG AAAAATTGGC CGGAAAAACG GCTTATGTAA CAGCAGGTGC TGCCCATGGC CATGCGTTGC TGGATGTGCT GGGAGAGCTT GGCATTAAAG CAGTCGGTGC AGCGATTTTC CATCACGACC CCATCTATGA CAGCGGACGT GAGGAAAACG ACCAACTGGC TCAGCGCGTA GCCGATTATG GAAATGTTTT TAACTACAAT GTTTGCAACA AGCAGGAGTT TGAGCTGGTC AATGCCTTAA ACCGCCTCCG TCCCGATGTA TTGCTGGCCC GGCATGGCGG CATGACTCTC TGGGGAGCAA AACTGGGCAT TCCGTCACTT TTAATTGGCG ATGAACATTA CTCCATGGGT TATGAAGGTC TGGTCAATTA CGGTGAGCGT ATTTTAGAAG TTATTGAAAA CGATGAATTT GTAAAAAACC TCGAAAAGCA TGCCATCAAT CCATACACCA AATGGTGGCT TGAGCAGCCG CCGTATTATT TCCTGAAAGG AGGTACCGGT AAATGA
|
Protein sequence | MSKINLSLPE VQIREIRINS ITGYQGDAKE LVEAREFGLK DKERSFSQCL GCATSKAACM TVLIQDAAVI SHGPVGCASC LHEFAFTYRV NYPLRGIERP TPRRIFSTNL KEKDTVYGGN IKLANTIREV YERTHANAIF VLTTCAAGII GDDVESVCNE AEEELGIPVV AIFCEGFRSK VWTTGFDAAY HGIARKLIQK PRRRRDDMIN VINFWGSDVF YEWFAPFGAK PNYIIPFSTV NGLKYASEAA ATVQACSTLG SYLGAVLEQD FGVPEIPAAP PYGIAQTDRW FRALGKILGK EEIAEKIIAE KKKEYLPKIE ALREKLAGKT AYVTAGAAHG HALLDVLGEL GIKAVGAAIF HHDPIYDSGR EENDQLAQRV ADYGNVFNYN VCNKQEFELV NALNRLRPDV LLARHGGMTL WGAKLGIPSL LIGDEHYSMG YEGLVNYGER ILEVIENDEF VKNLEKHAIN PYTKWWLEQP PYYFLKGGTG K
|
| |