Gene Ccel_3019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3019 
Symbol 
ID7311627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3569800 
End bp3571629 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content45% 
IMG OID643609921 
ProductS-layer domain protein 
Protein accessionYP_002507291 
Protein GI220930382 
COG category 
COG ID 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC GAATTTTATC CATAATGCTT GTAGTATTGA TGTGTTTATT ACTGTTTCCT 
CTCTCTGCAG CAGCGGAAGC AGTAAATGTG AATGCAACCT ATGCTAACGG AGTTGTGACG
ATTGCCGGTA CAGGCTTTAC AAGCGGCACC AGCTATTCAG TCAGAGTGGT AGATATAGTT
AATTCAAGCA TTAAGGCAAT GGGACAAACT ACGGCAGATG GAAGTGGAGA CATTTCAGCT
TCTATCACAA CCGGTGCTTT GGGAACACTG GCAAACTATA CAGTATATGT AAATAAGCCA
GACGGTACAT TTGCGGGGTC AGATACAACT ATTGTGGCAG ACATTACAAC CCACACTGCC
ACAATACAGG CAGGCACTGG CGGTACAATA ACCACAGGTG CGAGCGGAAG CTATGCCGCT
GGGGCGACAA TCACACTGGT AGCATCAGCA AACAGTGGGT ATGTTTTCAG TAGCTGGACT
TCAAATGCTG GTGGTACGTT TGTAAATGCA AACAGCGCAT CTACTACATT TACAATGCCT
GCTGCCAGTG TTACTATAAC AGCTAATTTT ACATACTCCA GTGGCAGTGG CGGCGGAGGC
AGTGTAATGC CTACTCCTAC GCCTGAGCCT GTCACTACAA AAGACGGCAA TGCCACAACC
GTATCCACTT CGGTAAAGGC AACGACTGAC TCAACAACAG GTACAGCTAC TGCAAGTGTT
GAAGCAAGTG CCTTTAACTC TTTAACTGAC AAGGCAAAAG AGGCTGAAAC TTCTGGACAA
AAAGCAGTGG TTGAAATAAA GGTTGGGGTT GCGGCAAACA CAACGGCTGT CACGGTGGAA
ATTCCGAGAG ATGCTTTTAA TAAGGTAGCA GAGGAAACCA AGGCAGACGT TAAAGTTGAC
GCAGGTATTG GAACTGTTAC CTTTAACACA AAGGCTGTTG AATCCATCAG TGGTGCTGTA
AATGCCGGAA ATATCTCTAT CAGCATTACC AAGGTGGACG CCTCTACCTT GACATCGGAG
GTTCAGGCAA GGGTTAGTGA AAGACCGGTA TTCGACTTCT CTGTCAAGTC AGGTAGCACT
GATATATCCA ACTTTAGGGG TGGAAACGCT AAAATCAGCA TACCCTATAC CCTAAAGCCT
GGCGAGAAGG AGAATTCCGT TGTTGTTTAC TATATCGATA ATACAGGGAA TCTCAAAACG
GTCAGGGGCA GGTATGACCC AACAACAAGA ACGGTTAATT TTACTACCTC CCATTTTTCA
CAGTATGCAG TAGGATACAA CGAAGTGAAC TTCAAAGATG TTGCAGCTAA GGCATGGTAT
AATGAGGCAG TAGGGTTTAT GTCGGCAAGA GGTATTGTTA ACGGCGTAGG TAGTGGCAAG
TTTGCACCTG CAAATAGTGT GACCCGTGCT GATTTCCTTA TCATGGTAAT GAATTCCTAT
GGTATAGAGA TTGACACAAC AATAACTGAT AATTTTGCTG ATGCAAGCAA CAAATACTAC
ACCAAGTATT TGGGAACGGC AAAACGTCTG GGGTTGGTGT CTGGTGTAGG TGAAAACAAA
TATGCACCAG AAGCTACCAT TAGCCGACAG GACATGTTTG CCATACTGTA CCGTGCATTG
GACAAACTAG GTGAACTGCC AACAGGTACA ATCGGCAAAA GCCTTGGAAG CTTCAGTGAC
GCAGGAGACA TAGCTGGTTA TGCAAATGAT GCCATGAAGC TGTTTGTGGA AACCGGGACT
ATTTCAGGAG ATGGGAGTAA GTTGACTCCT AAGGCGACCT CCACAAGGGC ACAGGCAGCA
CAGGTGTTAT ACAATCTACT TTTAAAATAG
 
Protein sequence
MNKRILSIML VVLMCLLLFP LSAAAEAVNV NATYANGVVT IAGTGFTSGT SYSVRVVDIV 
NSSIKAMGQT TADGSGDISA SITTGALGTL ANYTVYVNKP DGTFAGSDTT IVADITTHTA
TIQAGTGGTI TTGASGSYAA GATITLVASA NSGYVFSSWT SNAGGTFVNA NSASTTFTMP
AASVTITANF TYSSGSGGGG SVMPTPTPEP VTTKDGNATT VSTSVKATTD STTGTATASV
EASAFNSLTD KAKEAETSGQ KAVVEIKVGV AANTTAVTVE IPRDAFNKVA EETKADVKVD
AGIGTVTFNT KAVESISGAV NAGNISISIT KVDASTLTSE VQARVSERPV FDFSVKSGST
DISNFRGGNA KISIPYTLKP GEKENSVVVY YIDNTGNLKT VRGRYDPTTR TVNFTTSHFS
QYAVGYNEVN FKDVAAKAWY NEAVGFMSAR GIVNGVGSGK FAPANSVTRA DFLIMVMNSY
GIEIDTTITD NFADASNKYY TKYLGTAKRL GLVSGVGENK YAPEATISRQ DMFAILYRAL
DKLGELPTGT IGKSLGSFSD AGDIAGYAND AMKLFVETGT ISGDGSKLTP KATSTRAQAA
QVLYNLLLK