Gene Ccel_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1904 
Symbol 
ID7310626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2260730 
End bp2261944 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content40% 
IMG OID643608838 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_002506232 
Protein GI220929323 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.950618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAG TTTATTCGGT TTCGGACATA AATAATTATA TAAAGCAGCT TGTATCAAAT 
GACATAATAC TATCGGATGT GTCCATTCGG GGTGAAATAT CCAACTTCAA GCACCATTAT
ACAGGTCATA TGTATTTTAC AATAAAGGAC AAAAACAGCC TGCTCAAATG CGTTATGTTC
AGATCACAGG CGGTTTCATT GAGGTTCTCT CCAGAAAATG GCATGAAGGT GATAGTTTCT
GGTTATATTT CAGTTTTTGA AAGAGACGGG CAGTATCAGC TGTATGCAAG CAGTATGCAG
CCCGATGGAG TGGGGGCTCT CCATATTGCA TTTGAACAGT TGAAAGAGAA ACTCCAGCGG
GAGGGCCTTT TTGACCCGGA AAACAAGAAA AAGATACCTG TACTGCCCGG AAGTATCGGC
GTTGTTACCT CATCCACAGG TGCGGTTATA AGGGATATCA TAAATGTTAC CTACAGGAGA
AACAGCAAAA TGAAGCTGGT ATTGTATCCT GTAGCTGTTC AGGGTCAACA GGCTGCAGGA
CAGATAGCGG AAGCAATAAA GTGTCTTAAT GAGCAGAACA AGGTTGATGT AATTATTGTA
GCAAGAGGTG GCGGCTCTTT GGAAGAGCTG TGGGCATTCA ATGAGGAAAT TGTTGCCAGG
AGCATATATG CATCAAATAT TCCTGTTATA TCTGCCGTTG GACACGAAAC AGACTTTACT
ATATGTGACT TTGTATCGGA TATGCGGGCA CCGACTCCTT CAGCAGCAGC TGAACTTGCA
GTTCCTGACA TGGAGGTTCT TTTGTATAAA TTGGAAAGCT ACAACATGAG AATGAAAAGT
TCTCTTGCAA AAAAAGTGAC AACGTTGAAG AACCAGCTGC AAAAATTAAA TGCAAGGCCA
TTTTTTGCAC AGCCCTATGA CAGGGTAAAC CAGCAAAGAC AGACACTTGA CAATTTAACT
AAAAGTATGG TCAGAGAGAA CCAAACAATT ATAAAAGACA AGAAATCTCA ATTTGGTATG
CTTGCAGGTA AGTTAGATGC CTTGAGTCCT TTGAAAATAT TAGAACGTGG GTATAGTCTT
GTAAAAAATC CTCAAGGCTA TGTAGTAAAT AACGTAAAAC AGATTAACAT AGGAGATAAG
TTGGAAATAT TGATGAATGA CGGATTAGCA GAATGTGACG TAATATCTGT AAGAGAGGGA
AAAATTTATG AGTAA
 
Protein sequence
MNRVYSVSDI NNYIKQLVSN DIILSDVSIR GEISNFKHHY TGHMYFTIKD KNSLLKCVMF 
RSQAVSLRFS PENGMKVIVS GYISVFERDG QYQLYASSMQ PDGVGALHIA FEQLKEKLQR
EGLFDPENKK KIPVLPGSIG VVTSSTGAVI RDIINVTYRR NSKMKLVLYP VAVQGQQAAG
QIAEAIKCLN EQNKVDVIIV ARGGGSLEEL WAFNEEIVAR SIYASNIPVI SAVGHETDFT
ICDFVSDMRA PTPSAAAELA VPDMEVLLYK LESYNMRMKS SLAKKVTTLK NQLQKLNARP
FFAQPYDRVN QQRQTLDNLT KSMVRENQTI IKDKKSQFGM LAGKLDALSP LKILERGYSL
VKNPQGYVVN NVKQINIGDK LEILMNDGLA ECDVISVREG KIYE