Gene Cphy_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1714 
Symbol 
ID5741465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2106148 
End bp2109003 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content38% 
IMG OID641292814 
Productglycoside hydrolase family protein 
Protein accessionYP_001558825 
Protein GI160879857 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4724] Endo-beta-N-acetylglucosaminidase D 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AATATTATAA GTTTGTGTCA GTAGCCCTGA TTGCTACAAT GTTCGTTACC 
GGCTGTAGCA AGAATTCTTC CGACATAAAG GATAATGCCA CAAACTCAGA AGCAGTTGTC
ACAGAGGCAG TACAAGTACC GGAGAAAGAG CAGGCACCGG AGGAAAATCA AGCACCAGGG
AAAATAGTTT ATAGGGTAAC ACCGGAATCA GAATCCGATA TGAAAATGAT GAATATACCA
AATGCTCCTT ATTGGTTCCC GGCAGAGCTG TTGGAATGGA ATCCGAAGGA AGCCAAAGAT
TTAGATTTGT ATGTATCGAA TGTTCCCCTG AAAAGTAGAG TAGATAAGAG TAAGCTAGTG
AAAAGTAATA AGACTCAGAT AGCAGATTTT AAAGTTGCTG CATTGTCCAT CATGAATTCC
AGGACAAGCG GAAATTCTCC GCATGGATTG AATCGCTTCA ATGCAAATAC ATTCTCCTAT
TGGCAATATG TAGACCAATT AGTATATTGG GGAGGTTCTG CTGGAGAAGG CCTCATTGTA
CCACCTACGG CAGATGTAAC GGAAGCAGCT CATACCAATG GTGTTCCTGT ATTAGGTACC
ATATTCTTAG CACCGGAAGC ATGGGGCGGT AAAATAGTAT GGTTAGATGA TTTCTTGAAG
ACCGATGAGA ATGGTCAATT TTTATTAGTG GATAAGTTAA TCGAAGTATG TACGGTACTT
GGCTTTGATG GCTGGTTCAT TAATCAAGAA ATGGGCGGAA CGAAGGAAGC TCCATTAACG
AAAGAACATG CTGATAAAAT GTTATCATTT GTACAGCAAT TTAATGAAAA GTCAGCAGGT
AAGTTCTCCT TCATATGGTA TGATTCAATG ACTGAAGAGG GCGAAATTGA TTATCAAAAT
GCTCTTACTG ATAAAAATGA TGCTTTCCTG GTTGATGGAA ATGGGAAAAA AGGTGCGGAC
AGCATGTTTT TAAATTTCTG GTGGGCAGCA CGTGAATTAG CAGACCAGGA GTTGTTAGTA
AAATCCAATG AAAAAGCGAA AGAAAAAGGA ATTAATCCTT ATGATCTTTA TGCAGGCATT
GATGTTCAGG CAAATGGTGT TATGACCCGA ATTAACTGGG ATTTATTCGA GAAGAATAAT
GTACCGCATA CATCCATAGG GTTATATTGT CCAAGCTGGA CCTATTTTGC AGCAGATAGT
TTAGATGCTT TCGAGAGATA TGAAAATCGT ATGTGGGTTA ATGAACATGG TAATCCAACG
ATTCCGACAG AGGTTTCACA ATATGACTGG CATGGTATTT CTACATATGC AATCGAAAAA
ACAGTTGTAA ATCAATTACC TTTCATCACA AATTTTAATA AAGGAAATGG ATATAACTTC
TTTGTAAATG GTGAGAAAAT CTCTTTACAG GATTGGAATA ACAGAAGTAT TACAGATGTT
ATGCCAACCT ACCGCTGGAT TTTAAATCAG GAAGAAGGTA ATAAATTAAA ACCGGGTATT
GATTATGCAG ATGCATATTA TGGCGGAAAC TCAATACGCT TATATGGTGC AATGGAAACT
GATAAAAAGT CTGAATTAAC ATTATATAGC TCTGATCTGG CAATTGGCGA TAAAACGAAG
GCCTATGCGA TGCTGAAAGC AAACTATAAA GTAAATGCAG AGTTAGTATT AGAATTAGAA
GATGGTTCCA AAGAATATAT CAAAGGGGAT AATACTCTAA ACACTGAATG GGCAAAAATA
GTGTTTGATC TATCTAAAGT AAAAGGAAAG ACTGTTAAAA AGATTGGCTT TAACTTTACC
TCTTCTAAAT CGGATATGAT TCTCATCAAT ATGGGTAATC TTACTATTTT TGAAGAAAAC
ACAGTGGCTC CAACGACTCC TGGTAATTTA AAGATTGCAT ATCAGCAATT TGATGATGAT
GGACTATTTG CAGGGGTTAA GCTAGCATGG GATGGCGCTT CTAATGTTAG ATATTATGAA
GTATATAAAA TTAATGACGA CGGAAGTAAA TCCTTTCAGG GAGCTACTAC AGCAAAAGCA
CATTACATGA ATGCTTTGGT CAGGGAGGAA GGAGAGAAGA AGTCAACCTT TGAAGTAATT
GCAGTAAATA ATATGCAGCA AAGAAGTCAA GGTACTACGG TAGTAATGGA ATGGCCGAAT
ATTAGTTTAC CGAAAGCAAA TATGAAGGCT TCTAAAACCT TAGTTGCTCC TGGGGAAGTA
GTAGAATTTA CAAGCTTATG TTCTCAAAAC ACTACAGGTT ATGAATGGTC CTTTGAGGGA
GCAGATCATG CTGCTTCAAC AGAACAGAAT CCAAAAGTGA CATATTCTAA GGAAGGCGTT
TATAATGTTA GCTTAGTTGC TAAGAATGGG GAAGGAACAG CCGAATATAA AGTGGATGGT
TTGATTGTAG TTCGTAGTGA TGCTTCCGGT GATATTCCTG TAGTTTCAGA AGGTAAGACC
ACAAGTGCTT CCGGTTATGT AAATGAAAAT GAAGCACCTA GGTTTGCAGT CGATGGAAAA
ACAAATACAA AGTGGTGTGC AACAGGTTCA GCACCACATG ATATTACCAT TGACCTTGGT
GAAAATAAAA TGGTTAGTGA AGTTTACATG GCTCATGCAG AAAAAGGAAA CGAAAGCTCT
GATATGAATA CTTCATGGTA TACCATTGAA ACTAGTCTAG ATGGAAAGAG CTTTGAACCT
GCCATTGAAG TTAAAAACAA TGCTTCGGCT GAAACATTAG ACGCATTTAA GCCAAGAGAG
GCACGTTATG TAAAAGTTAC AGTGATGAAA CCAACACAAG GTTCCGATAC CGCTGTTCGT
ATTTATGAAA TTCAAGTTCA TGGGCTAGAT AAATAA
 
Protein sequence
MKKQYYKFVS VALIATMFVT GCSKNSSDIK DNATNSEAVV TEAVQVPEKE QAPEENQAPG 
KIVYRVTPES ESDMKMMNIP NAPYWFPAEL LEWNPKEAKD LDLYVSNVPL KSRVDKSKLV
KSNKTQIADF KVAALSIMNS RTSGNSPHGL NRFNANTFSY WQYVDQLVYW GGSAGEGLIV
PPTADVTEAA HTNGVPVLGT IFLAPEAWGG KIVWLDDFLK TDENGQFLLV DKLIEVCTVL
GFDGWFINQE MGGTKEAPLT KEHADKMLSF VQQFNEKSAG KFSFIWYDSM TEEGEIDYQN
ALTDKNDAFL VDGNGKKGAD SMFLNFWWAA RELADQELLV KSNEKAKEKG INPYDLYAGI
DVQANGVMTR INWDLFEKNN VPHTSIGLYC PSWTYFAADS LDAFERYENR MWVNEHGNPT
IPTEVSQYDW HGISTYAIEK TVVNQLPFIT NFNKGNGYNF FVNGEKISLQ DWNNRSITDV
MPTYRWILNQ EEGNKLKPGI DYADAYYGGN SIRLYGAMET DKKSELTLYS SDLAIGDKTK
AYAMLKANYK VNAELVLELE DGSKEYIKGD NTLNTEWAKI VFDLSKVKGK TVKKIGFNFT
SSKSDMILIN MGNLTIFEEN TVAPTTPGNL KIAYQQFDDD GLFAGVKLAW DGASNVRYYE
VYKINDDGSK SFQGATTAKA HYMNALVREE GEKKSTFEVI AVNNMQQRSQ GTTVVMEWPN
ISLPKANMKA SKTLVAPGEV VEFTSLCSQN TTGYEWSFEG ADHAASTEQN PKVTYSKEGV
YNVSLVAKNG EGTAEYKVDG LIVVRSDASG DIPVVSEGKT TSASGYVNEN EAPRFAVDGK
TNTKWCATGS APHDITIDLG ENKMVSEVYM AHAEKGNESS DMNTSWYTIE TSLDGKSFEP
AIEVKNNASA ETLDAFKPRE ARYVKVTVMK PTQGSDTAVR IYEIQVHGLD K