Gene Ccel_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2167 
Symbol 
ID7310859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2536304 
End bp2537614 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content40% 
IMG OID643609098 
Productamidohydrolase 
Protein accessionYP_002506489 
Protein GI220929580 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0888971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTGATA TACTTATAAA GAATACCGAG TTAATAACTA ATGATGAAAG TAAGCCGTTG 
ATTAAAGACG GATATATCGG AATCAAGGAT GGGTGTATTG ATTTCATTTC GGATAGTCTT
CCTGAAAATG TAAAAGCCAG AGAGGTGATT GACGGAAAAA ACAAGATAGC CATGCCCGGC
CTGGTAAATG CCCATAGCCA CAGTGCCATG ACACTTATGA GAAATTATGC TGATGATATA
GCACTCGAAA AATGGTTGTT TGATAATATT TTTCCGGTTG AAGCAAAACT CACTGATAAA
GATGTTTACT GGGGTACTAT GCTTGGTATC TCCGAAATGC TTAAATCAGG AATTACTGCA
TTTGCTGATA TGTATATGTT TATGGATGAG GTTGCACGTG CAGTAACTGA AACTGGTATA
AAGGCAAACC TTTGTAAAAG TCCGGTACAG TTTTTTGAGG ACGGGCAGCT TAAAAGACTT
GACAAAAGTC AGGGAACCAT TGATTATTAC AACAGCTATC ATAATTCGGC TAACGGAAGA
ATAAAGGTCT TCGTAGAAAT ACACTCAGTT TATATGTTTA ATGAAAATAC CCTTAGAAAT
GCGGCTCAAC TGGCTAAGCA GCTGAATACA GGTATACATA TACATTTACT TGAAACTCTC
TCTGAGGTTG AATCCAGTAA AAAGGACTAT GATATGACTT CTATAGAGAT ATGCAGAGAA
ACTGGGGTAC TTGATGTTCC TGTTATGGCG GCACATTGTG TCCATCTCAC TGACGGTGAC
CTTAGAATCA TGAAAGAGAA GAGGGCAAGT GTGGTTCATA ATCCGACCAG TAATCTCAAG
CTGGGAAGTG GCATTGCCAG AGTACCCGAA ATGATGGACA TGGGTATTAA TGTATGTCTT
GGTACTGACG GTGCTGCCAG CAACAATAAT CTTAATATGT TTGAGGAAAT GAATCTTGCT
GCAATACTCC ACAAGGGCGT CGCTATGAAC CCGCAGCTGA TGAAAGCCCA GGATGTTCTT
AAAATGGGAA CAGTTAACGG GGCAAGGGCT ATAGGTTTTG ATGATACAGG TATACTATCA
AAGGGAATGA AAGCAGACAT TATACTGGTT GATACAGATA AACCTCACTT TTATCCCAAA
AATAACCCAA TGTCAATGAT TGTATATTCG GCACAAGCAG CCGATGTGGA CACTGTTATA
GTTGATGGTA ATGTTCTGGT AAAGAAGCGT GAATTTATAC ATATTGATGA AGAGAGAATT
AAGTTTGAGG TAGATACTTT ATCCAAGAGG CTCCTGGGCA GACAACCATA G
 
Protein sequence
MLDILIKNTE LITNDESKPL IKDGYIGIKD GCIDFISDSL PENVKAREVI DGKNKIAMPG 
LVNAHSHSAM TLMRNYADDI ALEKWLFDNI FPVEAKLTDK DVYWGTMLGI SEMLKSGITA
FADMYMFMDE VARAVTETGI KANLCKSPVQ FFEDGQLKRL DKSQGTIDYY NSYHNSANGR
IKVFVEIHSV YMFNENTLRN AAQLAKQLNT GIHIHLLETL SEVESSKKDY DMTSIEICRE
TGVLDVPVMA AHCVHLTDGD LRIMKEKRAS VVHNPTSNLK LGSGIARVPE MMDMGINVCL
GTDGAASNNN LNMFEEMNLA AILHKGVAMN PQLMKAQDVL KMGTVNGARA IGFDDTGILS
KGMKADIILV DTDKPHFYPK NNPMSMIVYS AQAADVDTVI VDGNVLVKKR EFIHIDEERI
KFEVDTLSKR LLGRQP