Gene Cthe_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1565 
Symbol 
ID4810072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1892797 
End bp1894272 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content48% 
IMG OID640106983 
Productnitrogenase 
Protein accessionYP_001037984 
Protein GI125974074 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.016954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAA TAAATTTATC CCTTCCGGAA GTACAGATAA GGGAAATTCG TATTAATTCA 
ATAACGGGTT ATCAGGGAGA TGCTAAGGAA CTGGTAGAAG CCCGCGAATT CGGTCTGAAG
GATAAAGAAC GTTCCTTTAG CCAATGCCTG GGCTGTGCTA CCTCAAAAGC GGCCTGTATG
ACTGTGTTAA TTCAGGACGC TGCAGTCATC AGCCATGGAC CGGTGGGCTG TGCTTCCTGT
CTGCATGAAT TTGCCTTTAC CTATCGGGTG AATTATCCTT TGCGCGGTAT TGAACGTCCC
ACACCACGCC GTATCTTTTC CACCAATCTA AAGGAAAAGG ATACAGTTTA CGGAGGAAAT
ATAAAGCTTG CCAATACCAT TCGAGAGGTA TATGAGAGAA CGCATGCCAA CGCTATTTTT
GTATTGACCA CATGCGCTGC CGGAATTATC GGCGATGATG TGGAAAGCGT TTGCAACGAA
GCCGAGGAAG AGTTGGGAAT ACCGGTGGTA GCCATCTTTT GCGAAGGTTT TCGTTCCAAA
GTATGGACCA CAGGTTTTGA CGCTGCTTAC CACGGCATTG CACGCAAGCT GATTCAAAAA
CCCCGGAGGC GGCGGGATGA CATGATCAAT GTAATCAATT TCTGGGGCAG CGATGTGTTT
TACGAATGGT TTGCTCCCTT TGGAGCAAAA CCCAATTACA TAATCCCTTT TTCTACAGTG
AACGGATTAA AATATGCCAG CGAGGCCGCT GCCACCGTCC AGGCTTGCTC CACGCTGGGA
AGCTACCTGG GAGCAGTGCT GGAACAGGAT TTTGGTGTTC CCGAAATTCC TGCCGCCCCA
CCCTACGGTA TTGCACAAAC GGATAGATGG TTCAGGGCGT TGGGAAAGAT CCTCGGCAAA
GAAGAAATTG CTGAAAAAAT CATTGCGGAA AAAAAGAAAG AGTATCTGCC CAAAATTGAA
GCTCTACGGG AAAAATTGGC CGGAAAAACG GCTTATGTAA CAGCAGGTGC TGCCCATGGC
CATGCGTTGC TGGATGTGCT GGGAGAGCTT GGCATTAAAG CAGTCGGTGC AGCGATTTTC
CATCACGACC CCATCTATGA CAGCGGACGT GAGGAAAACG ACCAACTGGC TCAGCGCGTA
GCCGATTATG GAAATGTTTT TAACTACAAT GTTTGCAACA AGCAGGAGTT TGAGCTGGTC
AATGCCTTAA ACCGCCTCCG TCCCGATGTA TTGCTGGCCC GGCATGGCGG CATGACTCTC
TGGGGAGCAA AACTGGGCAT TCCGTCACTT TTAATTGGCG ATGAACATTA CTCCATGGGT
TATGAAGGTC TGGTCAATTA CGGTGAGCGT ATTTTAGAAG TTATTGAAAA CGATGAATTT
GTAAAAAACC TCGAAAAGCA TGCCATCAAT CCATACACCA AATGGTGGCT TGAGCAGCCG
CCGTATTATT TCCTGAAAGG AGGTACCGGT AAATGA
 
Protein sequence
MSKINLSLPE VQIREIRINS ITGYQGDAKE LVEAREFGLK DKERSFSQCL GCATSKAACM 
TVLIQDAAVI SHGPVGCASC LHEFAFTYRV NYPLRGIERP TPRRIFSTNL KEKDTVYGGN
IKLANTIREV YERTHANAIF VLTTCAAGII GDDVESVCNE AEEELGIPVV AIFCEGFRSK
VWTTGFDAAY HGIARKLIQK PRRRRDDMIN VINFWGSDVF YEWFAPFGAK PNYIIPFSTV
NGLKYASEAA ATVQACSTLG SYLGAVLEQD FGVPEIPAAP PYGIAQTDRW FRALGKILGK
EEIAEKIIAE KKKEYLPKIE ALREKLAGKT AYVTAGAAHG HALLDVLGEL GIKAVGAAIF
HHDPIYDSGR EENDQLAQRV ADYGNVFNYN VCNKQEFELV NALNRLRPDV LLARHGGMTL
WGAKLGIPSL LIGDEHYSMG YEGLVNYGER ILEVIENDEF VKNLEKHAIN PYTKWWLEQP
PYYFLKGGTG K