Gene Cphy_1163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1163 
Symbol 
ID5742886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1475031 
End bp1476437 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content36% 
IMG OID641292268 
Productcellulase 
Protein accessionYP_001558280 
Protein GI160879312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TCAGAAGAAT AGTTTCATTG TTTTTGATTG TAACAATCTT TATCACGACA 
TGCTTTTTTA ATGTTGGTCA AAAGGTATAT GCTGCTGATA CGAATAATGA TGATTGGCTA
CATTGTGTAG GCAATAAAAT TTATGACATG AATGGCAATG AGGTTTGGCT GACCGGTGCG
AATTGGTTTG GTTTTAACTG TACTGAAAAT GTATTTCATG GTGCATGGTA CGATATTAAG
GGGATGTTAA CTAATATTGC AAACAGAGGA ATAGGATTTT TAAGAGTTCC AATTTCAACG
GAACTTTTGT ATAGTTGGAT GATAGGCAAA CCTAATAAAG TTTCAAGTGT GACCGCTGTC
AATAATCCAC CTTATTATGT ATGCAACCCT GATTTTTATG ATCCTACAAC AAATAGTGTT
AAAAATAGTA TGGAAATATT TGATATCATT ATGGGATACT GCAAACAATT GGGGATCAAA
GTAATGGTAG ATGTTCATAG TCCGGATGCA AATAATTCAG GTCATAACTA TCCGTTATGG
TATGGGTTAA CTACGACTAC TGCAGGTGAA ATAACGACAG ATAAGTGGAT CAATACTCAA
GCATGGCTGG CTGATAAATA CAAAAATGAC GATACTATTC TGGCATTTGA TATAAAAAAT
GAACCTCATG GACAGAGGGG ATATAGTACT ACAACACCTA CTAATATAGC AAAATGGGAT
AATTCCACAG ATGAGAATAA CTGGAAGTAT GCGGCGGAAA GATGTGCGAA AGCTATACTT
GCTAAAAACC CTAAATTATT AATTATGATT GAAGGTGTTG AACAATACCC TAAAACTGAA
AAAGGTTATA ACTATAATAC ACCGGATGTA TGGGGAGCTA CTGGTGATCA GTCTCCATGG
TATAGTGCTT GGTGGGGTGG AAATTTAAGA GGAGTAAAGG ATTATCCAAT TAATATAGGC
ACTCTCAATA GTCAGATCGT CTATTCCCCT CATGACTATG GTCCTTCCGT ATACAACCAA
CCATGGTTTG ATAAGGATTT TACAACTCAG ACCCTATTAG ATGATTATTG GTATAATACT
TGGGCATATA TTAAAGATAA AGGTATTGCA CCACTTTTGA TAGGTGAGTG GGGAGGTTTT
ATGGATGGTG GAAAGAACCA GAAATGGATG ACATTATTAA GAGATTATAT AGTAAATAAT
CGTATCCACC ATACATTCTG GTGTATCAAT CCGAACTCAG GGGATACTGG AGGTTTACTA
GGATATGATT GGCAAACTTG GGATGAAGCA AAATACGCTT TATTAAAACC TGCATTATGG
CAGTCAAATG GTAAATTTAT TGGTCTAGAC CATCAGACAC CTCTTGGTGT AAATGGTATA
TCATTAGGGC AATATTATGG AAAATAA
 
Protein sequence
MSKIRRIVSL FLIVTIFITT CFFNVGQKVY AADTNNDDWL HCVGNKIYDM NGNEVWLTGA 
NWFGFNCTEN VFHGAWYDIK GMLTNIANRG IGFLRVPIST ELLYSWMIGK PNKVSSVTAV
NNPPYYVCNP DFYDPTTNSV KNSMEIFDII MGYCKQLGIK VMVDVHSPDA NNSGHNYPLW
YGLTTTTAGE ITTDKWINTQ AWLADKYKND DTILAFDIKN EPHGQRGYST TTPTNIAKWD
NSTDENNWKY AAERCAKAIL AKNPKLLIMI EGVEQYPKTE KGYNYNTPDV WGATGDQSPW
YSAWWGGNLR GVKDYPINIG TLNSQIVYSP HDYGPSVYNQ PWFDKDFTTQ TLLDDYWYNT
WAYIKDKGIA PLLIGEWGGF MDGGKNQKWM TLLRDYIVNN RIHHTFWCIN PNSGDTGGLL
GYDWQTWDEA KYALLKPALW QSNGKFIGLD HQTPLGVNGI SLGQYYGK