Gene Ccel_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3062 
Symbol 
ID7311663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3606409 
End bp3608367 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content42% 
IMG OID643609964 
ProductDNA-directed DNA polymerase 
Protein accessionYP_002507334 
Protein GI220930425 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TATTATCGAT AGATATCGAG ACCTACAGCA GTGTAGACCT TATAAAATGT 
GGGGTATACC GGTACGTTGA AGCTCCTGAC TTTGAAATAC TTCTGTTTGC ATATGCGTAC
GATGATGAAC CTATAACTGT TATAGACTTG GCAGACTTCG AAGAGTTACC GGAACAAGTA
ACGAAAGACC TTACAGATCC AGACGTAATA AAAGCTGCCT ATAATGCTAA CTTTGAAAGA
ACCTGTATAG CTAAATTCTA TAATAAGCCA ATGCCTCCAG AGCAATGGAG ATGTACGTCG
GTTCACGCTT TAACTTTAGG GCTCCCGGGC AACCTTGACG GAGTGGCTAA GGTTCTCGGG
CTTGATGCAC AGAAGGACAC TGCAGGAAAG AACCTTATAA AATACTTCTC AGTTCCCTGC
AAGCCTACAA AAGTAAATGG TGGTAGGACC CGTAACTTTC CACACCATGC ACCTGATAAA
TGGAAGCAGT TCAAGGACTA TTGTAAACAG GATGTTGTTG TTGAAAGGGC GATAAGAAAG
AAAATAGAAA AATACCCTGT TCCAGAACAT GAGTGGAAGC TCTGGACTTT GGATCAGAAA
ATTAATGATG CAGGTGTAAG ACTTGATCCA GTATTAGTCC AGCAAGCTTT AAAGTGTGAT
TTACAGTATG CAACAAGACT TGATGCAGAG GCAAAGGAAC TGACAGGCCT TGATAACCCG
AATAGTACTA CTCAATTAAC TGTATGGCTT AAAAAACAAG GATTGACTGT AGATAACGGG
CTTGGCAAAG ACTATATACC CGGGTTATTA GATCAGGCTA AGGGCGATGA GACAGTAACC
AGAATGTTGG AGCTCCGGAA AGAAATGTCT AAAACTAGCA CTAAGAAGTA TGAGGCTATG
GAACGGGCAA TGTGTTCGGA TGATAGGGCA AGAGGATTAT TGCAATTCTG TGGGGCTAAT
CGTACATGGC GCTGGGCGGG AAGATTAATT CAAGTTCAGA ACCTGCCACA GAACAAAATT
CCTGATTTGG GAGTAGCTCG AGAGTTGTTA CATAGTGGTA AATTTGAGGC AATAGAACTT
CTTTTTAATA GCCCGCCGTT TGTGTTATCA CAGCTAATTA GAACAGCTTT CATTCCTTCT
GACGGTTGCA GGTTTATTGT GTCGGACTTC TCGGCCATCG AGGCCAGAGT AATAGCTTGG
TTAGCAGGAG AGTCGTGGCG AATGAAGGTA TTTAAAAGTC ATGGGAAGAT ATATGAGGCT
TCTGCATCCC ATATGTTTCA TGTACCCATA GAGGAGATTA CTAAAGGAAA TCCACTTAGG
CAAAAAGGAA AAATCGCAGA ACTTGCTCTC GGATATGGTG GTAGTATTGG AGCCCTTGAA
GCAATGGGTG CACTTAAAAT GGGGCTTGAT GCAGATGAGC TTCCAGACCT TGTCACAGCC
TGGCGTAACT CAAACCCTAA AATCGTTAAG CTTTGGTGGG ACGTCGATAA AGCTGCTATG
ACAGCTGTGC GAGAACACAT GCCTGTTAGA ATTCAGTACG GTATTACCTT CTCTTATAAA
GATGGGTTTT TATTTATTAA ACTACCATCG GGGCGAAAAC TGGCTTATGT AAAACCCAAA
ATAAAGGATG GTAAATTCGG ACGTCCTGCA CTTACATATG AGGGAATGGA CCAGATAAAA
AAGACTTGGG AACGTATAGA CACCTATGGC CCCAAGTTGG TTGAGAATAT TGTGCAGGCT
GTAGCGAGGG ATTGTTTAGC AGTTAATATG CTAAGACTCG ACGATGCAGG ATATGACATA
AGAATGCATG TACACGACGA AGTAATATTG GACGTACCTA ATACCGATAC AGATGCTCTT
GCACAAGTAA ACGAAATAAT GAGTCAGCCT ATAGATTGGG CTCCTGGGTT ACCTCTCAGG
GCAGATGGAT ATGAAACTGA ATTTTATAAA AAAGATTAG
 
Protein sequence
MKKILSIDIE TYSSVDLIKC GVYRYVEAPD FEILLFAYAY DDEPITVIDL ADFEELPEQV 
TKDLTDPDVI KAAYNANFER TCIAKFYNKP MPPEQWRCTS VHALTLGLPG NLDGVAKVLG
LDAQKDTAGK NLIKYFSVPC KPTKVNGGRT RNFPHHAPDK WKQFKDYCKQ DVVVERAIRK
KIEKYPVPEH EWKLWTLDQK INDAGVRLDP VLVQQALKCD LQYATRLDAE AKELTGLDNP
NSTTQLTVWL KKQGLTVDNG LGKDYIPGLL DQAKGDETVT RMLELRKEMS KTSTKKYEAM
ERAMCSDDRA RGLLQFCGAN RTWRWAGRLI QVQNLPQNKI PDLGVARELL HSGKFEAIEL
LFNSPPFVLS QLIRTAFIPS DGCRFIVSDF SAIEARVIAW LAGESWRMKV FKSHGKIYEA
SASHMFHVPI EEITKGNPLR QKGKIAELAL GYGGSIGALE AMGALKMGLD ADELPDLVTA
WRNSNPKIVK LWWDVDKAAM TAVREHMPVR IQYGITFSYK DGFLFIKLPS GRKLAYVKPK
IKDGKFGRPA LTYEGMDQIK KTWERIDTYG PKLVENIVQA VARDCLAVNM LRLDDAGYDI
RMHVHDEVIL DVPNTDTDAL AQVNEIMSQP IDWAPGLPLR ADGYETEFYK KD