Gene Ccel_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3398 
Symbol 
ID7311960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3942231 
End bp3945155 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content38% 
IMG OID643610302 
Producthypothetical protein 
Protein accessionYP_002507666 
Protein GI220930757 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000354514 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATC TCAGAAAGCT TACTGCAGTT GTTATAGCCG TAGCATTGGT ACTAACATCT 
ATGACTGCTG CATTCGCTGC TTCAGGTTCA TATGAATTTG AAGATCAAGC AACAGTACTT
AAGGATCTTG GCATTTGGCA GGGAGACACT ACCGGTGACT TGATGCTTGG TGAAGATTTA
ACTAGAGCAC AAGGTGCTGT ATTAGTACTA AAGACCGTAT TAGGAAAGAC TGACAAAGAT
ATGGAAGCTG CAGATGTTTC CAAAATCGCA AGCTTTGATG ATGCTGACGA AGTTCCAGCA
TGGGCTGAAG GTTGGGTAGC TCTTGCTGTT CAAGAAGGCG TTATGAAGGG TGGCAACAAC
AAGTTAGCTG CTGGCGATCC TTTAAAGGGA AAAGATCTGG CATCTATGTT CATGAACGCT
CTTGGTTTTG CAGCTGAGAA CGATTATGCT ACATCAGTTG AATTGTTAGC TGCTAAGTCA
GCTGGTAAAA TTCTTGTAGC TATCGCTGAT GATATTACTG ATGCAGATCT TACAAGAGAT
GCTGCTTCCG CAGTAGTATT CGACACTTTA ACTGTTAAGG CTAAAGATGC AACTAAGACA
GTTGTTGAAG TTTTAGTTGG AACTGACGCT ACTAAGAAGG CAGTTGCTGA AAAAGCTGGT
TTGATAGTTG CTCCAGCTGC TCAGACAGTT ACTGACGTTA AGCCTTTAAA CTTGAAGCAG
GTTCAAATTA CATTTGCAAA AGACCTTGTA AAGGCTGATG CAGAAAAGAT AGCAAACTAT
GTTGTAACTG AAGGTACAAC TGACAAAGCT ACAGGTGGAA GTGTTGCTCT TCAGGCTGAT
GGAAAGTCAG TTATAGTAAC ATTGGGCCAG GGTATCACAA ATGGTGCAAC TGCTGTTGTT
GAAGTAAAGA ATTTTGCTAC ATACAAAAGT GATGCTGTGA AGTTTGAAGA CTCTACTGTA
CCTACAGTAC TTGGTGTTAC AGTATCTGGT CCAAACACTC TTACTGTAGA ATATAGTGAA
CCAGTTCAAC TTAAGTCATC TACGACACCT GTAACAGATG CTATCTCTGG TGGAGAATAC
AAGATTGATG GTGGAAACTA TATCCTTACT GATATTGAAA TAAATATTAA TAAGGTTACT
TTAACAGTAG GTGTTCCACT AACAGAGGGT GCACACAAGG TAAGCTTTGA GTCAAAGGGA
TTCATATTTG ACTATGCAGG ATACAATGTA CTGCCTAAGA CTGTTGAATT CACAGTAACA
AATGATAACA CAGCTCCTGT ACTTACATTA AAGTCAGCTG ATCCGAAGCA AATAGTTTTG
ACTTCTAACA AGCCTTTGAA AGAAGATAGT GTTAAGAGCG GTAACGTTAG ATACAGACAT
ACATACAATA CTGACACATA TGTTGTAAAA GGTAACGATA CAAAAACAGT TGATTTAGAG
ACTATTAGTA AAGTTACTCT CACTGATTCA GGAACCACAC TTACAATTGA CTTCTCAGGT
AATGTAATAC CATTGGGAGC AACTAATCTC TATATTGGAT ATGATGATGC AAATGGCACT
CAAATTCAGG ACTTATGGGG AAATAAATTA CCAGCTACAA CTATTCCTTT GAACATAACT
CTGGATACTG TTAAACCAAC AGTTACAGAA GTTAAGTTTG ATAATACATT GCAGTTAACA
GTTGTTTTCT CAGAAAAACT AAATAAGGCA TCAGCAGAAG ATAAAGGAAA CTATGTAATC
AAAGATTCAG CTGGAAAGGT TATAGCTGTT ACGGGTGCAA CACTTGTTAA TGACGATTCA
CTTAATAAGG TACAGCTTGC GTTTACAGAA GAATTGGGTG GTGGATCCTA TACTATCGAA
ATAAAGGGTG TTAAGGATGA CGCTTTTGTA AATAATGCAA TGGACACTTA TACATCAACA
TTGAACTTTA CTGATAAAGT TGCTCCAAAA GTAACTGTTG CTTCAGCAAG AATTGTTATC
AGTAAAGATT CAGCAAATAA TGCAGACAAG AAGGCATCCA TCTACATTCC ATTCAGTGAA
CAAATGGATC CTACTACTCT TGTGAAGGCT AACTTCATGA AGGCAATTGG TGACCCTTTA
TCAATAGATA CTAAGTTCGT AGCTTTGGGT GACAACGATA CAGTTACTCC TGCTGCTGAC
GGAAAGTCAG TTACAATTGT GTTAGATAAG AATGCTGATG CTTTCGTATA TGATCAAGTT
CAGATCAAGG TTGGTCTTGT AAAAGATGTT GCAGGAAATA CTCTGGCAAC TTATGTTCAA
GATGTAAAAC CTGCAAAGGA TGCTATAAAA ATCGAAAAGG TTGAAGCTAT AGCTAAAAAG
CAGATAAAGG TTACATTTGA TGGTAGACTT TCTACAATAA CTGCTAAAGG ATTTAAACTC
GCAAATGAAG CTGGTGAGCA AATCGCGTTG TCAGTTGCAA GTGTGGCATT GAACGACGAT
GGAAAGTCAG TAGTTGTATT CAACCTTGGA GCAGAACTCA AAGAGGATGC AACATATGCT
AATAAAGAAG CTGTAACAGT AGTATTGTTG TCTGTTGATG AAGCTGTTGC TTTGGATACT
AAATCATACT TAGGTGCGGT TATTTCAACT AGTTCTGAAA CCGCTTCAGA CGTAATTGTT
CCTACTGTTG ATACAACAAC GGTTATGGCT GATGGTACTA TTCAAGTTAC ATTCTTTGAG
AAAATTGATG CTTCAACTTT AGCAGCTAAA ACATTAAATG GTTTCTCAGT ATCAGGAGAT
GTTAAGATAA AATCAGTTGG CGCTTCAGGT AAGGTAATAA CTCTTACACC TGAAGATGGT
AAGAAGTTCT CAGACTCAAC TGTTGTTAAA TACAACTCAG TTGCTGGAAT TACTGATGAA
TCTGGAAACA AGGTTGCTGA CTTCGAAAAA ACAGCTAAAA AATAA
 
Protein sequence
MRNLRKLTAV VIAVALVLTS MTAAFAASGS YEFEDQATVL KDLGIWQGDT TGDLMLGEDL 
TRAQGAVLVL KTVLGKTDKD MEAADVSKIA SFDDADEVPA WAEGWVALAV QEGVMKGGNN
KLAAGDPLKG KDLASMFMNA LGFAAENDYA TSVELLAAKS AGKILVAIAD DITDADLTRD
AASAVVFDTL TVKAKDATKT VVEVLVGTDA TKKAVAEKAG LIVAPAAQTV TDVKPLNLKQ
VQITFAKDLV KADAEKIANY VVTEGTTDKA TGGSVALQAD GKSVIVTLGQ GITNGATAVV
EVKNFATYKS DAVKFEDSTV PTVLGVTVSG PNTLTVEYSE PVQLKSSTTP VTDAISGGEY
KIDGGNYILT DIEININKVT LTVGVPLTEG AHKVSFESKG FIFDYAGYNV LPKTVEFTVT
NDNTAPVLTL KSADPKQIVL TSNKPLKEDS VKSGNVRYRH TYNTDTYVVK GNDTKTVDLE
TISKVTLTDS GTTLTIDFSG NVIPLGATNL YIGYDDANGT QIQDLWGNKL PATTIPLNIT
LDTVKPTVTE VKFDNTLQLT VVFSEKLNKA SAEDKGNYVI KDSAGKVIAV TGATLVNDDS
LNKVQLAFTE ELGGGSYTIE IKGVKDDAFV NNAMDTYTST LNFTDKVAPK VTVASARIVI
SKDSANNADK KASIYIPFSE QMDPTTLVKA NFMKAIGDPL SIDTKFVALG DNDTVTPAAD
GKSVTIVLDK NADAFVYDQV QIKVGLVKDV AGNTLATYVQ DVKPAKDAIK IEKVEAIAKK
QIKVTFDGRL STITAKGFKL ANEAGEQIAL SVASVALNDD GKSVVVFNLG AELKEDATYA
NKEAVTVVLL SVDEAVALDT KSYLGAVIST SSETASDVIV PTVDTTTVMA DGTIQVTFFE
KIDASTLAAK TLNGFSVSGD VKIKSVGASG KVITLTPEDG KKFSDSTVVK YNSVAGITDE
SGNKVADFEK TAKK