Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2021 |
Symbol | |
ID | 7310730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2380780 |
End bp | 2382171 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608955 |
Product | protein of unknown function DUF342 |
Protein accession | YP_002506347 |
Protein GI | 220929438 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGAAC AAAAAGATTT AAAAGTATTG GTAACAGTTT CGCCAGACGA GCTAAAAGCT TTTATAACAC TGTACAATAC GGGGGACAAT TCAACTATTA AAAAAGAAGA TATTATGCTT GCACTCGAAA GTCAGAGGGT AGTTTTTGGC ATTAAGGAAG ATATTATAAA TTATCTGGTT GAAAGTCCTA TGTATAACGA ATCGTTTTGT GTTGCGGAAG GTATTGCACC TAAAAACGGG AAAAATGGTT CAGTTACATA TCATTTTAAC ACTTCTGTAA ACAAAACTCC AACCCTTATG GAGGATGGTA GAATAAATTA CAGGGAGTTA AACTTGATTC AGTCCGTTAA AAAGGGGCAG ATACTTTGTT CATTGGTTCC TCCTGTAGTA GGGGTAGAAG GAAAAAACGT TAAAGGGAGA GTCATTTCTG CTATAAACGG TAAACCTGCG GTATTGCCAA GGGGGAAGAA TGTTGCACTA TCTGAAGATG GGAAAAGTCT TATTGCTACA ACAGCGGGAG AGGTTGAATA CCTGGATGCT ACAAAAGTAA GCGTATATAC AAACCATGAA GTTCCTGCAG ATGTGGATAA TTCAACGGGA AATGTAAGCT TTGTTGGAAG TGTTATTATA AAGGGCAATG TTTTATCCGG CTTTTCGGTA GAAGCAGGAG GTAATGTTGA GGTTTTCGGG GTTGTTGAAG GTGCAACAAT AAAAGCCGGT GGGAATATTA TATTACGACG GGGAATGCAG GGTATGGGTA AAGGTAAGCT GATTGCCGGC GGTGATATAG TAGCGAGATA CATAGAATAC AGCAGCGTAG ACGCAAATAA TAATATTCAG GCGGAAGCCA TAATGCACAG TAATGTAAAA TGCGGAAACA AGCTGGAGCT GACAGGTAAT AAAGGACTTT TTGTCGGAGG CTCTTGCAAG GTTGGCAAAA TTGTTGTTGC AAAGGTTATA GGGTCACATA TGGCAACAAT TACAGATGTG GAAGTGGGTG CTGATCCTTC CGTAAGAGAG AGATACAAAA ATGCCAAAGA AGAATTAATT TCTATGGAAA GTGATATAAA GAAGGCAGAT CAGGCAATAA CAATTTTACG TAAAATGGAA AGTGCGGGTG CATTGACCCC TGATAAGCAG GAAATATTAA CAAAGAGTGT CCGAACAAAG GTATATTTAT CTTCAAAGAT TGAGGAAGTA AAGCAAGAAG CAGCAATTCT GGACGAAAAG CTACAACAGG AGGGTAATGG TAAGGTTCGT GCACTAAATT GCATTTATCC CGGAGTAAAG GTTTCAATCG GAACATGTAT GATGTATGTA AAGGAACCTC TTCAGTATTG TACCTTGTAC AGAGATGGTG CAGATGTACG TGTTGGGCCC ATTGACAAGT AA
|
Protein sequence | MVEQKDLKVL VTVSPDELKA FITLYNTGDN STIKKEDIML ALESQRVVFG IKEDIINYLV ESPMYNESFC VAEGIAPKNG KNGSVTYHFN TSVNKTPTLM EDGRINYREL NLIQSVKKGQ ILCSLVPPVV GVEGKNVKGR VISAINGKPA VLPRGKNVAL SEDGKSLIAT TAGEVEYLDA TKVSVYTNHE VPADVDNSTG NVSFVGSVII KGNVLSGFSV EAGGNVEVFG VVEGATIKAG GNIILRRGMQ GMGKGKLIAG GDIVARYIEY SSVDANNNIQ AEAIMHSNVK CGNKLELTGN KGLFVGGSCK VGKIVVAKVI GSHMATITDV EVGADPSVRE RYKNAKEELI SMESDIKKAD QAITILRKME SAGALTPDKQ EILTKSVRTK VYLSSKIEEV KQEAAILDEK LQQEGNGKVR ALNCIYPGVK VSIGTCMMYV KEPLQYCTLY RDGADVRVGP IDK
|
| |