Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1714 |
Symbol | |
ID | 5741465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2106148 |
End bp | 2109003 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641292814 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001558825 |
Protein GI | 160879857 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4724] Endo-beta-N-acetylglucosaminidase D |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC AATATTATAA GTTTGTGTCA GTAGCCCTGA TTGCTACAAT GTTCGTTACC GGCTGTAGCA AGAATTCTTC CGACATAAAG GATAATGCCA CAAACTCAGA AGCAGTTGTC ACAGAGGCAG TACAAGTACC GGAGAAAGAG CAGGCACCGG AGGAAAATCA AGCACCAGGG AAAATAGTTT ATAGGGTAAC ACCGGAATCA GAATCCGATA TGAAAATGAT GAATATACCA AATGCTCCTT ATTGGTTCCC GGCAGAGCTG TTGGAATGGA ATCCGAAGGA AGCCAAAGAT TTAGATTTGT ATGTATCGAA TGTTCCCCTG AAAAGTAGAG TAGATAAGAG TAAGCTAGTG AAAAGTAATA AGACTCAGAT AGCAGATTTT AAAGTTGCTG CATTGTCCAT CATGAATTCC AGGACAAGCG GAAATTCTCC GCATGGATTG AATCGCTTCA ATGCAAATAC ATTCTCCTAT TGGCAATATG TAGACCAATT AGTATATTGG GGAGGTTCTG CTGGAGAAGG CCTCATTGTA CCACCTACGG CAGATGTAAC GGAAGCAGCT CATACCAATG GTGTTCCTGT ATTAGGTACC ATATTCTTAG CACCGGAAGC ATGGGGCGGT AAAATAGTAT GGTTAGATGA TTTCTTGAAG ACCGATGAGA ATGGTCAATT TTTATTAGTG GATAAGTTAA TCGAAGTATG TACGGTACTT GGCTTTGATG GCTGGTTCAT TAATCAAGAA ATGGGCGGAA CGAAGGAAGC TCCATTAACG AAAGAACATG CTGATAAAAT GTTATCATTT GTACAGCAAT TTAATGAAAA GTCAGCAGGT AAGTTCTCCT TCATATGGTA TGATTCAATG ACTGAAGAGG GCGAAATTGA TTATCAAAAT GCTCTTACTG ATAAAAATGA TGCTTTCCTG GTTGATGGAA ATGGGAAAAA AGGTGCGGAC AGCATGTTTT TAAATTTCTG GTGGGCAGCA CGTGAATTAG CAGACCAGGA GTTGTTAGTA AAATCCAATG AAAAAGCGAA AGAAAAAGGA ATTAATCCTT ATGATCTTTA TGCAGGCATT GATGTTCAGG CAAATGGTGT TATGACCCGA ATTAACTGGG ATTTATTCGA GAAGAATAAT GTACCGCATA CATCCATAGG GTTATATTGT CCAAGCTGGA CCTATTTTGC AGCAGATAGT TTAGATGCTT TCGAGAGATA TGAAAATCGT ATGTGGGTTA ATGAACATGG TAATCCAACG ATTCCGACAG AGGTTTCACA ATATGACTGG CATGGTATTT CTACATATGC AATCGAAAAA ACAGTTGTAA ATCAATTACC TTTCATCACA AATTTTAATA AAGGAAATGG ATATAACTTC TTTGTAAATG GTGAGAAAAT CTCTTTACAG GATTGGAATA ACAGAAGTAT TACAGATGTT ATGCCAACCT ACCGCTGGAT TTTAAATCAG GAAGAAGGTA ATAAATTAAA ACCGGGTATT GATTATGCAG ATGCATATTA TGGCGGAAAC TCAATACGCT TATATGGTGC AATGGAAACT GATAAAAAGT CTGAATTAAC ATTATATAGC TCTGATCTGG CAATTGGCGA TAAAACGAAG GCCTATGCGA TGCTGAAAGC AAACTATAAA GTAAATGCAG AGTTAGTATT AGAATTAGAA GATGGTTCCA AAGAATATAT CAAAGGGGAT AATACTCTAA ACACTGAATG GGCAAAAATA GTGTTTGATC TATCTAAAGT AAAAGGAAAG ACTGTTAAAA AGATTGGCTT TAACTTTACC TCTTCTAAAT CGGATATGAT TCTCATCAAT ATGGGTAATC TTACTATTTT TGAAGAAAAC ACAGTGGCTC CAACGACTCC TGGTAATTTA AAGATTGCAT ATCAGCAATT TGATGATGAT GGACTATTTG CAGGGGTTAA GCTAGCATGG GATGGCGCTT CTAATGTTAG ATATTATGAA GTATATAAAA TTAATGACGA CGGAAGTAAA TCCTTTCAGG GAGCTACTAC AGCAAAAGCA CATTACATGA ATGCTTTGGT CAGGGAGGAA GGAGAGAAGA AGTCAACCTT TGAAGTAATT GCAGTAAATA ATATGCAGCA AAGAAGTCAA GGTACTACGG TAGTAATGGA ATGGCCGAAT ATTAGTTTAC CGAAAGCAAA TATGAAGGCT TCTAAAACCT TAGTTGCTCC TGGGGAAGTA GTAGAATTTA CAAGCTTATG TTCTCAAAAC ACTACAGGTT ATGAATGGTC CTTTGAGGGA GCAGATCATG CTGCTTCAAC AGAACAGAAT CCAAAAGTGA CATATTCTAA GGAAGGCGTT TATAATGTTA GCTTAGTTGC TAAGAATGGG GAAGGAACAG CCGAATATAA AGTGGATGGT TTGATTGTAG TTCGTAGTGA TGCTTCCGGT GATATTCCTG TAGTTTCAGA AGGTAAGACC ACAAGTGCTT CCGGTTATGT AAATGAAAAT GAAGCACCTA GGTTTGCAGT CGATGGAAAA ACAAATACAA AGTGGTGTGC AACAGGTTCA GCACCACATG ATATTACCAT TGACCTTGGT GAAAATAAAA TGGTTAGTGA AGTTTACATG GCTCATGCAG AAAAAGGAAA CGAAAGCTCT GATATGAATA CTTCATGGTA TACCATTGAA ACTAGTCTAG ATGGAAAGAG CTTTGAACCT GCCATTGAAG TTAAAAACAA TGCTTCGGCT GAAACATTAG ACGCATTTAA GCCAAGAGAG GCACGTTATG TAAAAGTTAC AGTGATGAAA CCAACACAAG GTTCCGATAC CGCTGTTCGT ATTTATGAAA TTCAAGTTCA TGGGCTAGAT AAATAA
|
Protein sequence | MKKQYYKFVS VALIATMFVT GCSKNSSDIK DNATNSEAVV TEAVQVPEKE QAPEENQAPG KIVYRVTPES ESDMKMMNIP NAPYWFPAEL LEWNPKEAKD LDLYVSNVPL KSRVDKSKLV KSNKTQIADF KVAALSIMNS RTSGNSPHGL NRFNANTFSY WQYVDQLVYW GGSAGEGLIV PPTADVTEAA HTNGVPVLGT IFLAPEAWGG KIVWLDDFLK TDENGQFLLV DKLIEVCTVL GFDGWFINQE MGGTKEAPLT KEHADKMLSF VQQFNEKSAG KFSFIWYDSM TEEGEIDYQN ALTDKNDAFL VDGNGKKGAD SMFLNFWWAA RELADQELLV KSNEKAKEKG INPYDLYAGI DVQANGVMTR INWDLFEKNN VPHTSIGLYC PSWTYFAADS LDAFERYENR MWVNEHGNPT IPTEVSQYDW HGISTYAIEK TVVNQLPFIT NFNKGNGYNF FVNGEKISLQ DWNNRSITDV MPTYRWILNQ EEGNKLKPGI DYADAYYGGN SIRLYGAMET DKKSELTLYS SDLAIGDKTK AYAMLKANYK VNAELVLELE DGSKEYIKGD NTLNTEWAKI VFDLSKVKGK TVKKIGFNFT SSKSDMILIN MGNLTIFEEN TVAPTTPGNL KIAYQQFDDD GLFAGVKLAW DGASNVRYYE VYKINDDGSK SFQGATTAKA HYMNALVREE GEKKSTFEVI AVNNMQQRSQ GTTVVMEWPN ISLPKANMKA SKTLVAPGEV VEFTSLCSQN TTGYEWSFEG ADHAASTEQN PKVTYSKEGV YNVSLVAKNG EGTAEYKVDG LIVVRSDASG DIPVVSEGKT TSASGYVNEN EAPRFAVDGK TNTKWCATGS APHDITIDLG ENKMVSEVYM AHAEKGNESS DMNTSWYTIE TSLDGKSFEP AIEVKNNASA ETLDAFKPRE ARYVKVTVMK PTQGSDTAVR IYEIQVHGLD K
|
| |