Gene Ccel_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1087 
Symbol 
ID7309902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1340592 
End bp1341767 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content35% 
IMG OID643608011 
Productprotein of unknown function DUF43 
Protein accessionYP_002505426 
Protein GI220928517 
COG category[R] General function prediction only 
COG ID[COG1568] Predicted methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATG TGCAGGATAT TGTTTTTAAA GTATATGAAA ATGTACACTT GGAAGAAGGA 
ATAGTAGTCC TCAAGAATTT CCTTGTAAAT GCGTATATGT ACAGGGGTAC TTCCGTAAAG
GAAATGTCCC GTATGTTGAA TTTACCAGTT CCTGTTGTGT CTGCGATTAA AAATGAATTT
AAAAAGAATG GGATTGTAGA TTTAAGCAAC GGCATAGGTT TGACAAAGAA TGGAGAAATC
TATGTGAAGG ATGTACTGGG GTATAAAAAT GCCGATACGG ATGTTCTGAA GGATATTTTG
GAAAATACTG GTATTGACCT TTCAAGGTTT GAAAAAGAGA TTGAAGAGTT GGGAGCGATT
TACCAAAACA GGCCTGAAGT CGATGTTGAG GTGGACCAAT CCAAGTGTAC AGCGGAAACA
GGGATGAAAA GGGCTGTACT CATGCTTAAA TCAGGCTGCC TGATAGGCAA AAAAATTGCT
TGTATTGGAG ACGATGATTT AACAAGTATA GCTATTGTTT TATTGTTAAA GCATATAGCA
GTAAGTGACA ACTTAAGCGG TATGGCGGAT ATTACTGTTT TTGATATAGA TAAACGGATA
TTGTCTTATA TAAAAAAAGT TTCCGAAGAA TATAAGATAG ACATAGAATG TATACAGCAT
GATTTATGCA ATCCCATTGA TAATCAGTAC AAAAATAAGT TTGATTGCAT TACTACTGAT
CCACCGTATA CATTAAACGG ACTGAACCTA TTTTTAAGCA GAGGTATTTC GGTTCTAAAA
AAAGAATCAA ATCTAAGTGT GTTTTTGTCC TTTGCACATA AAACTCCTCA GATTAGGTTT
TTAATGCAGC AGTTATTTGT GAACGAAGGT TTGATTTTGT CAAATATATA TCCCAAATTT
AATGTTTACG AAGGAGCACA AATACTTGGC GGTGTGAGCG ACCTTATGAT TCTTACCACT
ACTGCCCAGT ATACAAAAGA ATTAATTTCC GGCATATTCA GCGATGAAAT ATATACCGGG
AAGTTTAAAC AGACAATCAG AACATATGAG TGTAAGCAAT GCAGTGAGAA ATATTTGGTT
GGAATGAATC AGAAAATTAC AACGATTGAA CAGCTTAAAT CACAGGGATG TTTAAAATGC
CCTGCAAATA AGTTTAATTT AATAAGAAAA GGATAG
 
Protein sequence
MQDVQDIVFK VYENVHLEEG IVVLKNFLVN AYMYRGTSVK EMSRMLNLPV PVVSAIKNEF 
KKNGIVDLSN GIGLTKNGEI YVKDVLGYKN ADTDVLKDIL ENTGIDLSRF EKEIEELGAI
YQNRPEVDVE VDQSKCTAET GMKRAVLMLK SGCLIGKKIA CIGDDDLTSI AIVLLLKHIA
VSDNLSGMAD ITVFDIDKRI LSYIKKVSEE YKIDIECIQH DLCNPIDNQY KNKFDCITTD
PPYTLNGLNL FLSRGISVLK KESNLSVFLS FAHKTPQIRF LMQQLFVNEG LILSNIYPKF
NVYEGAQILG GVSDLMILTT TAQYTKELIS GIFSDEIYTG KFKQTIRTYE CKQCSEKYLV
GMNQKITTIE QLKSQGCLKC PANKFNLIRK G