Gene Cthe_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1566 
Symbol 
ID4810073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1894269 
End bp1895612 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content47% 
IMG OID640106984 
Productnitrogenase 
Protein accessionYP_001037985 
Protein GI125974075 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGAT CCGGATTGAT TGAACAAGAA CGTTTTACCT GTGCAATCGG TGCTTTGCAA 
ACCGTGGTGG CTATTCCGCG GGCCGTGCCG ATTCTTCATT CCGGTCCCGG CTGCGGCGAG
ATGATTGCCG GATTTTTTGA ACGGTCAACG GGATACGCCG GCGGTTCCAC ATCTCCCTGC
ACAAACTTTA CAGAAAAAGA AGTTGTGTTT GGCGGAATCA ACCGGCTGAG GGATATTATA
GAAAACACCT ACAAAGTATT GGATACGGAT TTGCAGGTGG TTCTGACCGG CTGCACCGCC
GGTATTGTCG GAGACGATGT GGACAGCCTT GTTTCTGAAT TTGCCCAAAA GGGTAAGCCG
ATTGTATCCG TGGAAACTGC AGGATTTAAG GCCACCAACT TTGAAGCTCA CAGCCTTGTG
GTTAATGCCA TTATAGATCA ATATGTAAGC CGGTTTGAAG ATGAGAATAA GCCAAAATCG
CAAAAAAACA CAGTGAATCT TATAGCCTCC ATTCCGTATC AGGATCCGTT TTGGAAGGGT
AATCTGGCCG AATACAAGCG TCTGCTTGCC GGTATTGGGC TTAAAGCCAA TGTTTTATTC
GGACCCCAGT CGGGAGGTGT AAAAGAGTGG CAGTCCATAC CGACAGCACT TTTCAATATT
TTAGTTTCCC CATGGTACGG AAAACCCATT GCGGATCACC TCAAATCCAA ATACGGGCAG
GAATATACAT GGTTTCATCA CATTCCCATA GGTGCCAATC AAACCGAGGC GTTCCTTAAT
CAAGTTGTGG AATTTGCCAT CGAACAAGGA GCAGATATTG ACAAAGAATC AGCCCAGGAG
TTTATCCGTC ACGAGTCCCA TGCCTACTAT GAGGAGATTG ATAACCTTGC CACCTTCCTT
TTGGAGTTTC GCTACGGTCT TCCCAACCAT GCCCATATCC TTCATGACGC GGGATATGTC
GTCGCACTGT CTAAGTTTTT GCTGCACGAG GTGGGAATTG TACCAAAGGA ACAATTTATT
ACCGATGCTA CACCGGAAAA ATTCCATGAA GCCATTCGCG CCGATTTGAA AAGCACCAGC
GATAAAAAGG AAATTCCGCT TTATTTTGAG CCCGATGCGG GAAAGGCGCA GGAGATTCTT
AGGGGAATCC ATCATAAAGG AAGGGGTCTT ATCATCGGTT CAGGATGGGA TAAGGAACTG
GCAAAAGAAA AAGGCTATGA TTTCCTTTCA GCTGCTTTAC CTTCTCCCTA CCGGTTGGTC
TTGACAACCA ATTACGCAGG ATTTACAGGA GGGCTTCGGG TTATAGAGGA CATCTACCAG
ACGGTTCTTT CAACCTATGC ATAA
 
Protein sequence
MSRSGLIEQE RFTCAIGALQ TVVAIPRAVP ILHSGPGCGE MIAGFFERST GYAGGSTSPC 
TNFTEKEVVF GGINRLRDII ENTYKVLDTD LQVVLTGCTA GIVGDDVDSL VSEFAQKGKP
IVSVETAGFK ATNFEAHSLV VNAIIDQYVS RFEDENKPKS QKNTVNLIAS IPYQDPFWKG
NLAEYKRLLA GIGLKANVLF GPQSGGVKEW QSIPTALFNI LVSPWYGKPI ADHLKSKYGQ
EYTWFHHIPI GANQTEAFLN QVVEFAIEQG ADIDKESAQE FIRHESHAYY EEIDNLATFL
LEFRYGLPNH AHILHDAGYV VALSKFLLHE VGIVPKEQFI TDATPEKFHE AIRADLKSTS
DKKEIPLYFE PDAGKAQEIL RGIHHKGRGL IIGSGWDKEL AKEKGYDFLS AALPSPYRLV
LTTNYAGFTG GLRVIEDIYQ TVLSTYA