Gene Ccel_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2303 
Symbol 
ID7312348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2685484 
End bp2687172 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content41% 
IMG OID643609232 
Producthydrogenase, Fe-only 
Protein accessionYP_002506620 
Protein GI220929711 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACAA TGATAATTGA CGGACAGCAT ATTGAATTTA CCGATGAAAA AAATATATTG 
GCAGTAATCA GAAAAGTGGG TATTGAGCTT CCTACCTTTT GCTATCACTC AGAACTCAGT
GTTTACGGTG CGTGCAGAAT GTGTATGGTT GAGGATAAAT GGGGCAGCAC CATAACCTCA
TGTTCAACAC CTCCAAAGGA CGGTATGGAG GTATGGACAA ATACAGATAA GCTGAAAAAA
CATAGAAAAA TGATATTGGA GCTGCTACTT GCCAATCATG ACAGGGATTG TACAACCTGT
GAAAAAAGTG GAAGATGTAA GCTGCAGGAA ATAGCTTTGA AGGTGGGAGT AAAGAAAGTA
AGGTTCGGAC AGGAAAAAAA GGAGATACCT ATTGATGATA TGGGACCATC CGTAATCAGA
AATCCAAATA AGTGCATACT ATGCGGAGAC TGTGTCAGGG CATGTCAGGA AATTCAGGGT
GTCGGAGTAC TGGACTTTGC TTACAGAGGA TCAAATTTGC AGGTAACAAC AGCCTTCAAT
AAATCATTGC AGGAAGTTGA TTGTGTTAAC TGCGGTCAGT GCAGAGTTGT TTGTCCTACT
GGAGCATTGA TGATAAAGAA AGACATAGAC AGAGCATATA AAGCTCTTCA TGACAAAAAT
AAGCGTGTTA TCGCACAGAT AGCACCTGCT GTACGTGTTG CCATCGGAGA AGATTTTGGT
CTTCAGCCGG GGGAGATATC AATGGGTAAG ATAGTTTCAG CACTTAGGAA GCTTGGCTTT
GATCAGGTAT TTGACACAGC TGTTGGAGCA GACCTGACTG TTATAGAAGA AGCCGAAGAA
CTGATGGACA GGATTCAGAG AAAAGAAAAA CTACCTTTGT TCAGCTCATG CTGTCCTGCT
TGGTTCAAGT ATGCAGAACA GAAGCACCCG GATCTAATGG AAAATGTGTC CTCCTGCCTG
TCTCCACAGC AGATGTTTGG AGCGGTTATA AAGGAACAGT ATAAAAGGGA AAAAGCTTCT
GACGAAAAGG AAAACGTTGT TATTGCAATA ATGCCGTGTA CCGCTAAGAA GTATGAAGCT
GCAAGACCTG AAAACACCAT AAACGGTGAA AGACAGGTAG ATATGGTAAT AACGACACAG
GAACTTGCAA TTATGATACA GGAAAACGGT ATAGTATTCA ATGAGCTTGA AGACGAAGCT
ATTGATATGC CTTTCGGATT TACCAGCGGT GCAGGTGTTA TATTTGGTGT CAGCGGAGGT
GTGTCCGAAG CGGTACTTCG TTATTACTAC AAGGAAAGAA ATGCTTCAAC ACTCAGAGGT
CTTTCATATT GCGGAGTCAG AGGTATGGAA GGAGTTAAGG AGGCATCAGC CGAAATTGAC
GGCAGAACCG TAAGAATCGG AATAGTTCAC GGTCTTAAAA ATGCTGAAAA GCTTATAAGA
AGAATAAAGA GCGGAGAAGA GAAATTTGAC TTTATTGAAG TTATGGCTTG CCCCGGTGGT
TGTATTGGTG GTGCAGGACA GCCTATTCCT CAAAATGAAA ATGTAAGAAA ACTAAGGGCA
AAGGGTATAT ACAAGGTAGA CAAGTCATTA CCGATAAAAC GTTCTGATGA CAATCCTACC
ATAGACGCAT TGTACAACGG TATATTAAAC AGTAATAGAA ATATTCTCCA TAGGAACGGA
AAACATTAA
 
Protein sequence
MGTMIIDGQH IEFTDEKNIL AVIRKVGIEL PTFCYHSELS VYGACRMCMV EDKWGSTITS 
CSTPPKDGME VWTNTDKLKK HRKMILELLL ANHDRDCTTC EKSGRCKLQE IALKVGVKKV
RFGQEKKEIP IDDMGPSVIR NPNKCILCGD CVRACQEIQG VGVLDFAYRG SNLQVTTAFN
KSLQEVDCVN CGQCRVVCPT GALMIKKDID RAYKALHDKN KRVIAQIAPA VRVAIGEDFG
LQPGEISMGK IVSALRKLGF DQVFDTAVGA DLTVIEEAEE LMDRIQRKEK LPLFSSCCPA
WFKYAEQKHP DLMENVSSCL SPQQMFGAVI KEQYKREKAS DEKENVVIAI MPCTAKKYEA
ARPENTINGE RQVDMVITTQ ELAIMIQENG IVFNELEDEA IDMPFGFTSG AGVIFGVSGG
VSEAVLRYYY KERNASTLRG LSYCGVRGME GVKEASAEID GRTVRIGIVH GLKNAEKLIR
RIKSGEEKFD FIEVMACPGG CIGGAGQPIP QNENVRKLRA KGIYKVDKSL PIKRSDDNPT
IDALYNGILN SNRNILHRNG KH