Gene Ccel_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3420 
Symbol 
ID7311980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3974126 
End bp3975415 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content36% 
IMG OID643610325 
Productprotein of unknown function DUF815 
Protein accessionYP_002507688 
Protein GI220930779 
COG category[R] General function prediction only 
COG ID[COG2607] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00358176 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATATTA ACACAAATGA ACTAATCCTC TACAAGCATT TCCATCGATC TGATATCTTT 
GAAGATATTT CGTGGGTTAT AAACAATTAC CAAAGCAAAG CTTTTTCTAA AGAGAATGTG
AAGACAAGAC TATATGAGGG GTTGCATAAA TTAATTGAAT TTTCAGTTGA CCATTGTTTT
GATGGCAATT TATTCCATTC ATACCTCACC TATGTTCTTA TAACCAATGA AAATGCCTTT
GCAAGGGCTT GTGAGATATC GGGAAAACCA AGCGGTACAA TTAATGACCT TGCTTTAAGT
GATTTTAGGA TTTTTAGAAA GCTATATGAT TTTGATTTAT CCTTGATTGA CGTAGTTCTC
CAAGTACAGT GTTTTAGCAT CGTTACAAGC TACATGCCCT CGGATATCTG TAAGAACCGG
AACAAAGGAA TACCTTATTT TAAAGAACTT GTTAACAACC TATCAACATC TGAGAATGAA
GAATCATTCT ATGCAATACT GACTGAATTT TACAGGAAGT TTGGAGTTGG GTTACTTGGT
CTTAACAAGG CATTCAGTGT TATTCATTAT GAAAATGACA TTAAACTCAA GCCTATTACA
AATATTGAAT CTGTACTTCT AAATGATCTT GTCGGCTATG AGGCTCAAAA AAAAGAGCTT
GTTATGAACA CAGAAGCTTT TGTGCAGGGC CGCAAAGCCA ACAATGTCCT TTTGTACGGT
GACAGTGGTA CAGGTAAATC TTCAAGCGTT AAAGCAATCT TAAATGAATA CTATAAACAT
GGCCTCAGAA TGATAGAGGT TTACAAGTAT CAAATGAAGG ATTTACCCCT TATAATAGGG
CATATAAAAT ATAGGAAGTA TAAGTTTATA ATTTACATGG ACGATTTATC TTTTGAGGAC
CATGAAACTG ATTATAAGTA CCTGAAGGCA GTCATTGAAG GCGGCTTGGA GGCGAAACCT
GAAAACGTTC TCATATATGC AACTTCTAAC AGAAGACATA TTATCCGTGA GAAATGGAGT
GACAAAAATG ACAGGGATGA TGACCTCCAT ACTAACGATA CAGTTCAAGA AAAACTTTCA
CTATCAGCCA GGTTCGGATT ACCGATTCTA TATATAGCTC CCAGTAGAAA GGAGTTTCTT
CATATTGTTA AGGTACTTGC AGATAAATAC CGTATTAATA TACCAGAAGA AGAACTATAC
CTGGAGGCAA ACCGGTGGGA ACTACGTAAC GGTGGCCTAT CAGGAAGAAC CGCTCAGCAT
TTTATCACTT ATCTGCTTGG AAAAAAATAA
 
Protein sequence
MYINTNELIL YKHFHRSDIF EDISWVINNY QSKAFSKENV KTRLYEGLHK LIEFSVDHCF 
DGNLFHSYLT YVLITNENAF ARACEISGKP SGTINDLALS DFRIFRKLYD FDLSLIDVVL
QVQCFSIVTS YMPSDICKNR NKGIPYFKEL VNNLSTSENE ESFYAILTEF YRKFGVGLLG
LNKAFSVIHY ENDIKLKPIT NIESVLLNDL VGYEAQKKEL VMNTEAFVQG RKANNVLLYG
DSGTGKSSSV KAILNEYYKH GLRMIEVYKY QMKDLPLIIG HIKYRKYKFI IYMDDLSFED
HETDYKYLKA VIEGGLEAKP ENVLIYATSN RRHIIREKWS DKNDRDDDLH TNDTVQEKLS
LSARFGLPIL YIAPSRKEFL HIVKVLADKY RINIPEEELY LEANRWELRN GGLSGRTAQH
FITYLLGKK