Gene Ccel_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2075 
Symbol 
ID7310776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2432415 
End bp2435105 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content38% 
IMG OID643609008 
ProductDNA polymerase I 
Protein accessionYP_002506400 
Protein GI220929491 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000172322 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGG AAGATAAAAT TTTAGTTGTT GACGGTAACA GCATACTTAA CAGAGCTTTT 
TACGGACTTA GCAGAGCTGC TATGCTGACA ACGTCCGAGG GATTATATAC AAATGCGGTA
TTTGGGTTCA TAAATATACT TTCAAAGCAC CTTCAGGATG AAAACCCAAA GTATGTTTGT
GTAGCATTTG ATTTAAAAGC TCCTACCTTC AGGCACAAGG AGTATGATCA ATACAAGGCT
CAAAGAAAGG GAATGCCAAA TGAGCTGGCA GTTCAGGTTC CCATCATAAA ACAGGTTTTG
GATGCTATGA ACATTGCAAG AGTTGAAGTA GAAGGGTTTG AAGCTGACGA TATACTGGGA
ACTGTTTCTT CGTATGCTGA AAAACAGGGA ATGAAAACGA TACTATTGAC CGGAGACAGG
GATTCATTAC AGCTGGCATC AAACTATACA AGAATCAAGC TTCCTGTAAC TAGGGCTAAT
AAAACAGAGA CAGATGAATA TGACTATGAA AAGGTTATCG AAAAATACGG TGTGACACCA
GGACAGCTAA TAGATGTAAA GGGGCTTATG GGTGATACCT CGGACAACAT TCCCGGAGTT
CCCGGTATCG GTGAAAAAAC TGCACTTGCT CTGATTAAAA AGTTTAATTC TTTAGAAGAA
CTATATGAAA ACATAGATAA AGTGGACAAG AAGGGTGTCC GTGAAAAACT GGAGAATAAC
AAAGAGCTCG CTTTCATGTG TAAAAGGCTG GCTACTATAT ACAGAAAAGT ACCCGGTGTT
GAGAATCTGA ACGATTTTGC AAGATTAGAA ATTGACAAAG AAAAATTATA CAGTATCTTC
AAGCGTCTTG AGTTTAAAAC ATTAATTGAA AAGTTCGGTC TTGAAAATAC TCCTTTCGTA
GAGGCGACGG AAGCTTTAAA AATTGAACAT GTGGATGTTG ATTCCATAAG TGAACTACAA
TCATATATAA GTGTAATCAA ACTCAGCGGT ATGGTATCGG TGTATTATCA CGTTGACCCT
GCAGGCAGCT ATCTGGATGA TCTTTGTATT TTTGCTTGTA ATGAGGAATA CGCTCCTGCA
AATATCATTT TTTCTGAAAA ATTAACCTGT GAAACGGTAG TAAATGAACT GCGTGAAATT
TTTGAGAGCA AGGATATTGA AAAATACGGA CATGATTTAA AAAACCTTTA CAAGTATCTC
AAGTCCCATG GGATAGAGCT TGAAAATGTA ATATTTGATA CTTTTATAGC AGCATATATA
CTTGAACCCA CCAGAAGCAC CTACACAATT TCGGAGCTGT CTGAAGATAA ACTAAAACAG
AGCATTACGC CTGTTGAAAT TTTGTATGAC AAACACGGCA AAAGACTTGA GCAGGGACAG
GATGTAAGTT CATCTGAGGT TTGTGCTGCC GCGGTGAATG CCATATACGG ACTGACTCAG
AAGTTACGTC CCATTATTAG GGATAATGGC CAGGATGAGC TTTATTATAA AATAGAACTT
CCTCTGGTTG AGGTACTGGC AAATATGGAA TTAAGGGGCT TTAAGGTTGA CGTAGAAAAT
CTGAAAGCAT ACTCGAAAGA ATTGGATTCA AGACTTGTAA TTCTTGAAAA TGAAATATAT
ATGCAGGCAG GAGAGACCTT TAATATTAAT TCCCCAAAAC AGCTGGGTGT AATTCTTTTT
GAAAAACTGG GACTGCCTGT GGGAAAGAAA ACGAAAACAG GATATTCAAC AAGTGCCGAG
GTATTAGAAC AGCTATCCTA CAAGCATGAG ATAGTAGAAA GGATTCTTGA ATACAGGCAG
TTAATGAAGC TCAAGTCAAC ATACGCTGAT GGACTTTTGT CTGTATTGGA ACAGGATGGC
AAGATTCATT CAAACTTTAA CCAGACTGTT ACGGCAACCG GTAGAATCAG CAGTACGGAG
CCGAATCTAC AAAATATTCC TGTTAAATTA GAAATGGGAA GAAAAATCAG AAAGGTATTT
ATTCCTACCA ATAGCGATTA TGTTCTGCTG GATGCTGATT ATTCACAGAT AGAACTAAGA
GTGCTTGCAC ATATAACAGG TGATCCGAAT ATGATAGAAG CCTTTATAAA TAACGAGGAT
ATTCATACTA CTACTGCTTC AAAGGTATTT GGAATACCGC CTGAAGAGGT ATCATCATTA
ATGAGGTCAA GGGCGAAAGC CGTTAATTTC GGAATAGTTT ACGGTATTGG CGATTTCAGT
CTTTCAAAAG ACATTGGGGT AACTAAAAAG GAAGCCCGAA AGTATATCGA TGACTATCTT
GACAAATATT CCAAGGTTAA GGAGTATATG AGTGATACCG TTGAAAAAGG CAAAGAATTT
GGGTTTGTGA CAACTCTCTA CAACAGAAGA AGGTATCTTC CCGAGCTTAA ATCCAGTAAT
TTTAATATGC GTTCCTTTGG AGAGCGTGTG GCAATGAATA CACCTATTCA GGGAAGTGCG
GCAGATATTA TAAAAATATC AATGGTAAAG GTTTATACTG AACTGAAAAA GAGAAAGCTG
AAGTCAAAAC TTATTCTTCA GGTTCATGAT GAATTGATTG TGGAAACAGA AAAATCTGAA
TTGGAAGAGG TATCAAAATT GTTAAAGGAT TGTATGGAAA ATGCCGTACA ATTAAAAGTA
CCTTTAACAG TTGATGTAAA ACATGGGGAT AGTTGGTATG ATACAAAATA G
 
Protein sequence
MNSEDKILVV DGNSILNRAF YGLSRAAMLT TSEGLYTNAV FGFINILSKH LQDENPKYVC 
VAFDLKAPTF RHKEYDQYKA QRKGMPNELA VQVPIIKQVL DAMNIARVEV EGFEADDILG
TVSSYAEKQG MKTILLTGDR DSLQLASNYT RIKLPVTRAN KTETDEYDYE KVIEKYGVTP
GQLIDVKGLM GDTSDNIPGV PGIGEKTALA LIKKFNSLEE LYENIDKVDK KGVREKLENN
KELAFMCKRL ATIYRKVPGV ENLNDFARLE IDKEKLYSIF KRLEFKTLIE KFGLENTPFV
EATEALKIEH VDVDSISELQ SYISVIKLSG MVSVYYHVDP AGSYLDDLCI FACNEEYAPA
NIIFSEKLTC ETVVNELREI FESKDIEKYG HDLKNLYKYL KSHGIELENV IFDTFIAAYI
LEPTRSTYTI SELSEDKLKQ SITPVEILYD KHGKRLEQGQ DVSSSEVCAA AVNAIYGLTQ
KLRPIIRDNG QDELYYKIEL PLVEVLANME LRGFKVDVEN LKAYSKELDS RLVILENEIY
MQAGETFNIN SPKQLGVILF EKLGLPVGKK TKTGYSTSAE VLEQLSYKHE IVERILEYRQ
LMKLKSTYAD GLLSVLEQDG KIHSNFNQTV TATGRISSTE PNLQNIPVKL EMGRKIRKVF
IPTNSDYVLL DADYSQIELR VLAHITGDPN MIEAFINNED IHTTTASKVF GIPPEEVSSL
MRSRAKAVNF GIVYGIGDFS LSKDIGVTKK EARKYIDDYL DKYSKVKEYM SDTVEKGKEF
GFVTTLYNRR RYLPELKSSN FNMRSFGERV AMNTPIQGSA ADIIKISMVK VYTELKKRKL
KSKLILQVHD ELIVETEKSE LEEVSKLLKD CMENAVQLKV PLTVDVKHGD SWYDTK