Gene Ccel_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2021 
Symbol 
ID7310730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2380780 
End bp2382171 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content39% 
IMG OID643608955 
Productprotein of unknown function DUF342 
Protein accessionYP_002506347 
Protein GI220929438 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGAAC AAAAAGATTT AAAAGTATTG GTAACAGTTT CGCCAGACGA GCTAAAAGCT 
TTTATAACAC TGTACAATAC GGGGGACAAT TCAACTATTA AAAAAGAAGA TATTATGCTT
GCACTCGAAA GTCAGAGGGT AGTTTTTGGC ATTAAGGAAG ATATTATAAA TTATCTGGTT
GAAAGTCCTA TGTATAACGA ATCGTTTTGT GTTGCGGAAG GTATTGCACC TAAAAACGGG
AAAAATGGTT CAGTTACATA TCATTTTAAC ACTTCTGTAA ACAAAACTCC AACCCTTATG
GAGGATGGTA GAATAAATTA CAGGGAGTTA AACTTGATTC AGTCCGTTAA AAAGGGGCAG
ATACTTTGTT CATTGGTTCC TCCTGTAGTA GGGGTAGAAG GAAAAAACGT TAAAGGGAGA
GTCATTTCTG CTATAAACGG TAAACCTGCG GTATTGCCAA GGGGGAAGAA TGTTGCACTA
TCTGAAGATG GGAAAAGTCT TATTGCTACA ACAGCGGGAG AGGTTGAATA CCTGGATGCT
ACAAAAGTAA GCGTATATAC AAACCATGAA GTTCCTGCAG ATGTGGATAA TTCAACGGGA
AATGTAAGCT TTGTTGGAAG TGTTATTATA AAGGGCAATG TTTTATCCGG CTTTTCGGTA
GAAGCAGGAG GTAATGTTGA GGTTTTCGGG GTTGTTGAAG GTGCAACAAT AAAAGCCGGT
GGGAATATTA TATTACGACG GGGAATGCAG GGTATGGGTA AAGGTAAGCT GATTGCCGGC
GGTGATATAG TAGCGAGATA CATAGAATAC AGCAGCGTAG ACGCAAATAA TAATATTCAG
GCGGAAGCCA TAATGCACAG TAATGTAAAA TGCGGAAACA AGCTGGAGCT GACAGGTAAT
AAAGGACTTT TTGTCGGAGG CTCTTGCAAG GTTGGCAAAA TTGTTGTTGC AAAGGTTATA
GGGTCACATA TGGCAACAAT TACAGATGTG GAAGTGGGTG CTGATCCTTC CGTAAGAGAG
AGATACAAAA ATGCCAAAGA AGAATTAATT TCTATGGAAA GTGATATAAA GAAGGCAGAT
CAGGCAATAA CAATTTTACG TAAAATGGAA AGTGCGGGTG CATTGACCCC TGATAAGCAG
GAAATATTAA CAAAGAGTGT CCGAACAAAG GTATATTTAT CTTCAAAGAT TGAGGAAGTA
AAGCAAGAAG CAGCAATTCT GGACGAAAAG CTACAACAGG AGGGTAATGG TAAGGTTCGT
GCACTAAATT GCATTTATCC CGGAGTAAAG GTTTCAATCG GAACATGTAT GATGTATGTA
AAGGAACCTC TTCAGTATTG TACCTTGTAC AGAGATGGTG CAGATGTACG TGTTGGGCCC
ATTGACAAGT AA
 
Protein sequence
MVEQKDLKVL VTVSPDELKA FITLYNTGDN STIKKEDIML ALESQRVVFG IKEDIINYLV 
ESPMYNESFC VAEGIAPKNG KNGSVTYHFN TSVNKTPTLM EDGRINYREL NLIQSVKKGQ
ILCSLVPPVV GVEGKNVKGR VISAINGKPA VLPRGKNVAL SEDGKSLIAT TAGEVEYLDA
TKVSVYTNHE VPADVDNSTG NVSFVGSVII KGNVLSGFSV EAGGNVEVFG VVEGATIKAG
GNIILRRGMQ GMGKGKLIAG GDIVARYIEY SSVDANNNIQ AEAIMHSNVK CGNKLELTGN
KGLFVGGSCK VGKIVVAKVI GSHMATITDV EVGADPSVRE RYKNAKEELI SMESDIKKAD
QAITILRKME SAGALTPDKQ EILTKSVRTK VYLSSKIEEV KQEAAILDEK LQQEGNGKVR
ALNCIYPGVK VSIGTCMMYV KEPLQYCTLY RDGADVRVGP IDK