Gene Ccel_2859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2859 
Symbol 
ID7312416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3411221 
End bp3412357 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content44% 
IMG OID643609754 
Producthypothetical protein 
Protein accessionYP_002507133 
Protein GI220930224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value7.59798e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATC ACGCAGTACT TTCCGCATCA GGGTCCCATA GGTGGTTGAA TTGCCTACCA 
TCTGCCAGAT TGGAACTGGA ATTTGAAAAT AGCGAATCCA ATGCAGCCGC TGAAGGCACC
GCCGCCCATG CTCTCTGCGA ACATAAACTC AAAAAAGCAC TTCACATGAG AAGTAAGCGT
CCTGTCTCAG TTTATAACTC CGATGAGATG GAAGAACACA GTGATGCCTA TGTGGAATTT
GTAATGGAAC AGCTTGAGCT GGCAAAGCAG AGCTGCACAG ACCCTTTAAT ACTAATCGAA
CAGCGTCTTG ATTTTTCCTG CTATGTTCCA CAGGGATTTG GAACCGGTGA CTGCATCATT
ATTGCCGATA AGAAACTTCA CATTATTGAT TTCAAGTATG GCATGGGAGT ATTGGTAGAC
GCGGTGGACA ATCCGCAGAT GAAACTGTAT GCACTGGGTG CTTTAGAAAT TTACGATAGT
TTGTACGACA TCGAGGAAGT GTCCATGACC ATTTTCCAGC CACGCAGAGA AAATGTCAGC
ACATGGACAA TCCCGGTAAA GGAATTAAAA GACTGGGCAG AAAATGAACT GAGACCAAAG
GCGAAAAAGG CCTATAAAGG CGAAGGTGAC TATCTTCCAG GTGAATGGTG TACTTTCTGT
CGAGCGGCTG TTAAATGCCG TGCAAGAGCA GAAGAAAAAC TGAAATTAGC ACAGATGGAA
TTCAAGCTAC CCCCACTACT TACGGACTCT GAAATTGAGG AAGTTCTCTC TAAATTGTCC
GATCTTACAA AGTGGGCAAA TGAAATCATT GCTTATGCCA CGGATGCAGC CGTTAATCAC
GGGAAAGAGT GGCACGGTTT TAAGGTAGTA GAGGGCAGAT CGGTCCGTAA ATATAAGGAC
GAAAAAGCTG TTGCTGAAGC AGCCAAAGCA AACGGATATA AGGACATCTA CCGTCAGAAT
CTCATTACCC TTACAGAAAT GCAGAAGCTG ATGGGCAAAA AGAAATTTGA GCAAATTCTC
GGTGGTCTTA TACATAAACC ACCGGGCAAG CCAACGCTGG TTCCAAATTC GGATAAGCGA
CCAGCTATGA ATATATCAAA TGTAAAAAAC GAATTTAATG AAATTACGGA GGGATAG
 
Protein sequence
MSDHAVLSAS GSHRWLNCLP SARLELEFEN SESNAAAEGT AAHALCEHKL KKALHMRSKR 
PVSVYNSDEM EEHSDAYVEF VMEQLELAKQ SCTDPLILIE QRLDFSCYVP QGFGTGDCII
IADKKLHIID FKYGMGVLVD AVDNPQMKLY ALGALEIYDS LYDIEEVSMT IFQPRRENVS
TWTIPVKELK DWAENELRPK AKKAYKGEGD YLPGEWCTFC RAAVKCRARA EEKLKLAQME
FKLPPLLTDS EIEEVLSKLS DLTKWANEII AYATDAAVNH GKEWHGFKVV EGRSVRKYKD
EKAVAEAAKA NGYKDIYRQN LITLTEMQKL MGKKKFEQIL GGLIHKPPGK PTLVPNSDKR
PAMNISNVKN EFNEITEG