Gene Ccel_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1804 
Symbol 
ID7310535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2158134 
End bp2159279 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content39% 
IMG OID643608736 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002506134 
Protein GI220929225 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00796527 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATGGA GACAGCTAAG AGAACAGCAG GAAAATTTAC TGAGTCCCTA TGCAGCAAGA 
AGTGTTAATT CTTTTGGCAG GATAATCAAT GAAGAAAAAT GTCCTATCAG AACAGATTAC
GAACGTGATG GGAACAGGAT TTTATACTCC ATGGAATTCA GACGATTAAG ACACAAAACA
CAGGTGTTTT TTAATGCAAA GAACGACCAC ATATGTACAC GAATGGAGCA TGTATTAAAT
GTAGGCTCAA TAGCCGTCAC TATAGCCAGA ACGCTGAATT TGAATCAGGA CCTTACATAT
GCAATAGCTT TAGGGCATGA TCTTGGGCAT GCTCCATTCG GCCACAGCGG TGAACGTGTA
TTAGATAAAT GCATGAAGAA AGTAAATTTA GAGTTCGGTT TTCAACATGA GCTCCACAGC
CTTAGGGTTG TAGAAAAGCT TGCAACACGT ATCTCTAAAG AAAAAATACA TGAAAAATGC
GGACTCAATC TAACTTTTGA AGTAAGGGAC GGGATAGTAT CACACTGTGG TGAAAACTAT
AACGAATATT CTTTAAAAAG AGATTTAGGG AAAACACCAC AGTCACTGTG TAATGTAAAA
GACAGAGGAG GTCTTCCGTT TACCCTTGAA GGCTGCATTG TGAGACTGGT AGATAAAATA
GCATACGCCG GAAGGGATAT TGAGGACGCT GTACGTGTCA ACCTGATGAA CGTCCATGAT
ATACCCAAGG ATATAAGAAA TGAGTTGGGT AACACAAATG GGGAAATAAT AAATACTCTG
GTTTGTGATA TGGTTGAGAA CAGCTACGGC AGGGATTGTA TCCGGCTTAG CCCGGATAAA
GGCCAGGCAC TTGAAAAACT CATAAATGAA AATATAAGGT TAATTTATAA GGCTGATAAA
ATAACCCGAT ATGAAAAAAT TGCCGAAAAC ACTCTTGAAG GTTTGTTTGA CAGCCTGCTC
GACAGCCTAT CTGATTTTGA AAAGCTTCAG ACAAATGAAA ATAAGGTTTA CAGAATGTTT
TATAACTTTA TAGCAGACAA GGCATACGAC GAGTCTGAAA GTGATGCTCA AAAAGTAATT
GATTTTATAG CCGGGATGAC AGACCAGTTT GCACAAAGCT GCTTTGAAGA AATATACTGG
ATGTAG
 
Protein sequence
MIWRQLREQQ ENLLSPYAAR SVNSFGRIIN EEKCPIRTDY ERDGNRILYS MEFRRLRHKT 
QVFFNAKNDH ICTRMEHVLN VGSIAVTIAR TLNLNQDLTY AIALGHDLGH APFGHSGERV
LDKCMKKVNL EFGFQHELHS LRVVEKLATR ISKEKIHEKC GLNLTFEVRD GIVSHCGENY
NEYSLKRDLG KTPQSLCNVK DRGGLPFTLE GCIVRLVDKI AYAGRDIEDA VRVNLMNVHD
IPKDIRNELG NTNGEIINTL VCDMVENSYG RDCIRLSPDK GQALEKLINE NIRLIYKADK
ITRYEKIAEN TLEGLFDSLL DSLSDFEKLQ TNENKVYRMF YNFIADKAYD ESESDAQKVI
DFIAGMTDQF AQSCFEEIYW M