Gene Ccel_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2007 
Symbol 
ID7310716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2369950 
End bp2371191 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content38% 
IMG OID643608941 
ProductUMUC domain protein DNA-repair protein 
Protein accessionYP_002506334 
Protein GI220929425 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.470929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGGG TAATTTTACA TTGTGACCTG AATAATTTTT ACGCTTCTGT TGAATGTCTG 
TATAATCCAC AGTTTAGGGA TTATCCTTTG GCAGTATGCG GAAGTCAGGA TTTGCGTCAT
GGAATTGTTT TGGCTAAAAA TTATATTGCA AAAAAGTTTG GTATAAAAAC AGGCGAGGCT
ATCTGGCAGG CAAAACAAAA GTGTCCAAAC CTCGTTGTTG TTAATCCAAA TTATGCTTTA
TACCTGAGGT TTTCAAAGGA TGCTAGGGAA ATTTATTCCA GGTATTCCAA CCTTGTTGAA
AGTTTTGGCA TAGACGAGTG CTGGATTGAC GTTTCTGAAA GCACCAAGCT GTTCGGAGAC
GGGGAAAAAA TTGCAAATGA AATACGTGCA CTTATTAAAA CAGAACTTGG TGTTACTGCT
TCAGTTGGAG TGAGCTTTAA TAAGATATTT GCAAAGCTTG GGTCTGATCT ACAAAAGTCT
AATGCTACTA CTGTTATTAA CCAAAATAAT TTTAAGGAAA TGGTTTGGAA TTTAAACGTT
GGGGAGTTAC TTTATGTGGG CAGATCAACC CGGAAGAAAC TAAACCAGAT TGGAATAATG
ACTATCGGAG ATCTTGCAGG ACTTCCTCCC TCTTTCATTA GAAGATATCT CGGAAAATGG
GGAGAAATTC TCTGGAATTT TGCTAATGGC ATGGACTATT CCGAAGTAAC TGCAACAGAT
TATCACGAAA CTATAAAGGG AATCGGTAAC AGTATGACGA CCGCAAGGGA TCTTGTAAAC
ACAGAGGATG TCAAGCTTAC CTTTACTGTA CTGGCTGAAA GTGTGGCAGA GAGACTTAGA
AAACATAATT TAAAGGGTTC TACAATACAG ATTTATATTC GTGATAATGA GCTTGCCTCA
ATTGAACGTC AAGCAAAGCT CCCGGTTTCC AGCTATATAT CCGGTGAAAT CACACGTAAA
GCTATGAACA TTTTTAATAC AAATTGGAGT TGGTATAAGC CTATACGCTC TCTTGGTATA
CGTGCAACTG ATTTGGTTAC TGCCGACAGC CATACCCAAC TTTCCTTTTT TGACAATTAT
AATAAACGTC CACAATTGGA AAATTTGGAA TTCAGTATTG ACGCCATTCG AAAAAGGTTT
GGCCATTACT CTGTTCAAAG GGCAATTTTG CTTAAAGACA GTGCTCTTAA TGCTAATCCC
ATTGAAGACA ACATTATTCA TCCTGTTTCA TTTTTTAGGT AA
 
Protein sequence
MDRVILHCDL NNFYASVECL YNPQFRDYPL AVCGSQDLRH GIVLAKNYIA KKFGIKTGEA 
IWQAKQKCPN LVVVNPNYAL YLRFSKDARE IYSRYSNLVE SFGIDECWID VSESTKLFGD
GEKIANEIRA LIKTELGVTA SVGVSFNKIF AKLGSDLQKS NATTVINQNN FKEMVWNLNV
GELLYVGRST RKKLNQIGIM TIGDLAGLPP SFIRRYLGKW GEILWNFANG MDYSEVTATD
YHETIKGIGN SMTTARDLVN TEDVKLTFTV LAESVAERLR KHNLKGSTIQ IYIRDNELAS
IERQAKLPVS SYISGEITRK AMNIFNTNWS WYKPIRSLGI RATDLVTADS HTQLSFFDNY
NKRPQLENLE FSIDAIRKRF GHYSVQRAIL LKDSALNANP IEDNIIHPVS FFR