Gene Cphy_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3404 
Symbol 
ID5743681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4166683 
End bp4168008 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content38% 
IMG OID641294510 
Productglycoside hydrolase family protein 
Protein accessionYP_001560496 
Protein GI160881528 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.316905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGT TAATTTATAA AAGTAATGAA ACCATGAGAT ATCAAAAAAG CGAAGTTTCC 
TTTGTTGAGA ATCCTCGTGC AGAGATGAAC CTAATCAAAC TATATCCTAA GGAAACAAGG
CAGACAATCT ACGGCTTTGG TGGTGCATTT ACCGAAGCAG CGGCGGTAAC CGTAGCTTCC
ATGAGTGAGA CTTCAAAAAA GAAGGTGTTG GATGCATATT TTTCTAAGGA TGGACACAAG
TATAACTTTT GTAGAACACA TATCCAAAGC TGCGATTTTT CCCTTGGTAA TTATGCCTAT
GTGGAGGATC CTGAAGATAA AGAGCTAAAA ACCTTTGACT TAAAACGTGA TCATCAATAC
TTAATCCCAT TTATCAAAGA TGCGCTTACT CTAAATCCAT CGCTTATCTT AGTAGCAAGT
CCATGGAGTC CGCCAGGATT TATGAAAAGC AATGGTGAGA TGAATCATGG TGGTGTACTA
AAAAAAGAAT ATTATCAGAT GTGGGCTGAT ATGATAGTGC GTTATCTTAA AGAATACGAA
AAGCTAGGGA TTAACGTACA ATACTTATCA GTGCAGAATG AACCAAAAGC TACACAAACA
TGGGACTCCT GCCTTTATAC AGGGGAAGAA GAGGGAGTAT TTGCAGCGGA ATATCTGAGG
AAAACACTAG ATGCAAATGG GTATCCTCAT GTAAAAATAG CAATCTGGGA CCATAACAAA
GACTGTATCA TAGAAAGAAC AGAGGAAACA TTTGCTGTAC CAATGGCAAG AGAAAGTGTC
GCTGCGATTG CATTTCACTG GTATTCCGGG GATCATTTTG AAGCTTTGCA GACCGTGAAG
GAAAAATATC CTGAAAAAGA GCTTATCTTT ACAGAAGGGT GTGTAGAATA TTCCAGATTT
AAGACAAATA GTCAAGTGAA GAATGCGGAA ATGTATCTTC ATGATATCAT CGGTAACCTT
AATTCTGGTA TGAATGCCTA TATTGACTGG AACCTTGTAT TAAATGTTGA TGGAGGACCA
AATCATGTAG GTAACTTTTG TGATGCTCCT GTTATGTATG ATAAGGAAAC GGATGAGCTT
GATTTTAAGT TATCCTATTA TTATTTGGGA CATTTAAGTC GTTTTGTAAC AGAAGGTGCG
AAAAGGTTCG TGGTATCTCG CTGTACCGAT AAAGTGGAAG CGGTAGGATT TCTAAATCCA
GATAACAGTA AGGTTTTAGT ATTGATGAAT CGGACCGAAG AAGATAAGGT ATTACAAATC
TGTGAGGGAA ATAAGGTAGC AGATATCCAT TTAGAGGCGC ATTCGATTAT GACAATTTGT
TGGTAG
 
Protein sequence
MKALIYKSNE TMRYQKSEVS FVENPRAEMN LIKLYPKETR QTIYGFGGAF TEAAAVTVAS 
MSETSKKKVL DAYFSKDGHK YNFCRTHIQS CDFSLGNYAY VEDPEDKELK TFDLKRDHQY
LIPFIKDALT LNPSLILVAS PWSPPGFMKS NGEMNHGGVL KKEYYQMWAD MIVRYLKEYE
KLGINVQYLS VQNEPKATQT WDSCLYTGEE EGVFAAEYLR KTLDANGYPH VKIAIWDHNK
DCIIERTEET FAVPMARESV AAIAFHWYSG DHFEALQTVK EKYPEKELIF TEGCVEYSRF
KTNSQVKNAE MYLHDIIGNL NSGMNAYIDW NLVLNVDGGP NHVGNFCDAP VMYDKETDEL
DFKLSYYYLG HLSRFVTEGA KRFVVSRCTD KVEAVGFLNP DNSKVLVLMN RTEEDKVLQI
CEGNKVADIH LEAHSIMTIC W