Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2075 |
Symbol | |
ID | 7310776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2432415 |
End bp | 2435105 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643609008 |
Product | DNA polymerase I |
Protein accession | YP_002506400 |
Protein GI | 220929491 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000172322 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCGG AAGATAAAAT TTTAGTTGTT GACGGTAACA GCATACTTAA CAGAGCTTTT TACGGACTTA GCAGAGCTGC TATGCTGACA ACGTCCGAGG GATTATATAC AAATGCGGTA TTTGGGTTCA TAAATATACT TTCAAAGCAC CTTCAGGATG AAAACCCAAA GTATGTTTGT GTAGCATTTG ATTTAAAAGC TCCTACCTTC AGGCACAAGG AGTATGATCA ATACAAGGCT CAAAGAAAGG GAATGCCAAA TGAGCTGGCA GTTCAGGTTC CCATCATAAA ACAGGTTTTG GATGCTATGA ACATTGCAAG AGTTGAAGTA GAAGGGTTTG AAGCTGACGA TATACTGGGA ACTGTTTCTT CGTATGCTGA AAAACAGGGA ATGAAAACGA TACTATTGAC CGGAGACAGG GATTCATTAC AGCTGGCATC AAACTATACA AGAATCAAGC TTCCTGTAAC TAGGGCTAAT AAAACAGAGA CAGATGAATA TGACTATGAA AAGGTTATCG AAAAATACGG TGTGACACCA GGACAGCTAA TAGATGTAAA GGGGCTTATG GGTGATACCT CGGACAACAT TCCCGGAGTT CCCGGTATCG GTGAAAAAAC TGCACTTGCT CTGATTAAAA AGTTTAATTC TTTAGAAGAA CTATATGAAA ACATAGATAA AGTGGACAAG AAGGGTGTCC GTGAAAAACT GGAGAATAAC AAAGAGCTCG CTTTCATGTG TAAAAGGCTG GCTACTATAT ACAGAAAAGT ACCCGGTGTT GAGAATCTGA ACGATTTTGC AAGATTAGAA ATTGACAAAG AAAAATTATA CAGTATCTTC AAGCGTCTTG AGTTTAAAAC ATTAATTGAA AAGTTCGGTC TTGAAAATAC TCCTTTCGTA GAGGCGACGG AAGCTTTAAA AATTGAACAT GTGGATGTTG ATTCCATAAG TGAACTACAA TCATATATAA GTGTAATCAA ACTCAGCGGT ATGGTATCGG TGTATTATCA CGTTGACCCT GCAGGCAGCT ATCTGGATGA TCTTTGTATT TTTGCTTGTA ATGAGGAATA CGCTCCTGCA AATATCATTT TTTCTGAAAA ATTAACCTGT GAAACGGTAG TAAATGAACT GCGTGAAATT TTTGAGAGCA AGGATATTGA AAAATACGGA CATGATTTAA AAAACCTTTA CAAGTATCTC AAGTCCCATG GGATAGAGCT TGAAAATGTA ATATTTGATA CTTTTATAGC AGCATATATA CTTGAACCCA CCAGAAGCAC CTACACAATT TCGGAGCTGT CTGAAGATAA ACTAAAACAG AGCATTACGC CTGTTGAAAT TTTGTATGAC AAACACGGCA AAAGACTTGA GCAGGGACAG GATGTAAGTT CATCTGAGGT TTGTGCTGCC GCGGTGAATG CCATATACGG ACTGACTCAG AAGTTACGTC CCATTATTAG GGATAATGGC CAGGATGAGC TTTATTATAA AATAGAACTT CCTCTGGTTG AGGTACTGGC AAATATGGAA TTAAGGGGCT TTAAGGTTGA CGTAGAAAAT CTGAAAGCAT ACTCGAAAGA ATTGGATTCA AGACTTGTAA TTCTTGAAAA TGAAATATAT ATGCAGGCAG GAGAGACCTT TAATATTAAT TCCCCAAAAC AGCTGGGTGT AATTCTTTTT GAAAAACTGG GACTGCCTGT GGGAAAGAAA ACGAAAACAG GATATTCAAC AAGTGCCGAG GTATTAGAAC AGCTATCCTA CAAGCATGAG ATAGTAGAAA GGATTCTTGA ATACAGGCAG TTAATGAAGC TCAAGTCAAC ATACGCTGAT GGACTTTTGT CTGTATTGGA ACAGGATGGC AAGATTCATT CAAACTTTAA CCAGACTGTT ACGGCAACCG GTAGAATCAG CAGTACGGAG CCGAATCTAC AAAATATTCC TGTTAAATTA GAAATGGGAA GAAAAATCAG AAAGGTATTT ATTCCTACCA ATAGCGATTA TGTTCTGCTG GATGCTGATT ATTCACAGAT AGAACTAAGA GTGCTTGCAC ATATAACAGG TGATCCGAAT ATGATAGAAG CCTTTATAAA TAACGAGGAT ATTCATACTA CTACTGCTTC AAAGGTATTT GGAATACCGC CTGAAGAGGT ATCATCATTA ATGAGGTCAA GGGCGAAAGC CGTTAATTTC GGAATAGTTT ACGGTATTGG CGATTTCAGT CTTTCAAAAG ACATTGGGGT AACTAAAAAG GAAGCCCGAA AGTATATCGA TGACTATCTT GACAAATATT CCAAGGTTAA GGAGTATATG AGTGATACCG TTGAAAAAGG CAAAGAATTT GGGTTTGTGA CAACTCTCTA CAACAGAAGA AGGTATCTTC CCGAGCTTAA ATCCAGTAAT TTTAATATGC GTTCCTTTGG AGAGCGTGTG GCAATGAATA CACCTATTCA GGGAAGTGCG GCAGATATTA TAAAAATATC AATGGTAAAG GTTTATACTG AACTGAAAAA GAGAAAGCTG AAGTCAAAAC TTATTCTTCA GGTTCATGAT GAATTGATTG TGGAAACAGA AAAATCTGAA TTGGAAGAGG TATCAAAATT GTTAAAGGAT TGTATGGAAA ATGCCGTACA ATTAAAAGTA CCTTTAACAG TTGATGTAAA ACATGGGGAT AGTTGGTATG ATACAAAATA G
|
Protein sequence | MNSEDKILVV DGNSILNRAF YGLSRAAMLT TSEGLYTNAV FGFINILSKH LQDENPKYVC VAFDLKAPTF RHKEYDQYKA QRKGMPNELA VQVPIIKQVL DAMNIARVEV EGFEADDILG TVSSYAEKQG MKTILLTGDR DSLQLASNYT RIKLPVTRAN KTETDEYDYE KVIEKYGVTP GQLIDVKGLM GDTSDNIPGV PGIGEKTALA LIKKFNSLEE LYENIDKVDK KGVREKLENN KELAFMCKRL ATIYRKVPGV ENLNDFARLE IDKEKLYSIF KRLEFKTLIE KFGLENTPFV EATEALKIEH VDVDSISELQ SYISVIKLSG MVSVYYHVDP AGSYLDDLCI FACNEEYAPA NIIFSEKLTC ETVVNELREI FESKDIEKYG HDLKNLYKYL KSHGIELENV IFDTFIAAYI LEPTRSTYTI SELSEDKLKQ SITPVEILYD KHGKRLEQGQ DVSSSEVCAA AVNAIYGLTQ KLRPIIRDNG QDELYYKIEL PLVEVLANME LRGFKVDVEN LKAYSKELDS RLVILENEIY MQAGETFNIN SPKQLGVILF EKLGLPVGKK TKTGYSTSAE VLEQLSYKHE IVERILEYRQ LMKLKSTYAD GLLSVLEQDG KIHSNFNQTV TATGRISSTE PNLQNIPVKL EMGRKIRKVF IPTNSDYVLL DADYSQIELR VLAHITGDPN MIEAFINNED IHTTTASKVF GIPPEEVSSL MRSRAKAVNF GIVYGIGDFS LSKDIGVTKK EARKYIDDYL DKYSKVKEYM SDTVEKGKEF GFVTTLYNRR RYLPELKSSN FNMRSFGERV AMNTPIQGSA ADIIKISMVK VYTELKKRKL KSKLILQVHD ELIVETEKSE LEEVSKLLKD CMENAVQLKV PLTVDVKHGD SWYDTK
|
| |