Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3368 |
Symbol | |
ID | 5741650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4107033 |
End bp | 4109792 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641294471 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_001560460 |
Protein GI | 160881492 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00223117 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAC GTATCTACAA GAGAGTAGCA GCAGCCATTA TGACCGCTGC AATGGTAGTT ACTCTAGTTC CTCAGGGAGC AAAGACATCC GCTTTGGCTG GTGAAACTGA GCAAGCTTTA GCGGCGGCAG CAACAAGGGG AACCTATGAG CAACGTTTTA TGGACTTATG GTCGGATATT AAAAACCCAA AGAACGGTTA TTTTAGTCCT CAGGGAATTC CGTATCATTC TATTGAAACA ATGATTGTAG AAGCTCCTGA TTATGGTCAT GTAACTACTA GTGAGGCAAT GAGTTACTAT ATGTGGCTTG AAGCTATGTA CGGCAAGTTT ACAGGTGACT TTTCTGGATA TGGGACCGCT TGGAATGTAG CAGAAAAATA TATGATTCCA ACGGATGCAG ATCAACCACC AACCAGTATG AGTAAGTATA CACCGAGTAA ACCTGCAACT TATGCACCTG AGTATCAGGA TCCTAGTCAG TACCCAGCGA AGCTCGATTC GAGTGCTCCT GTTGGTAGTG ACCCAATTTG GTCACAGCTT GTTGCAGCTT ATGGACGGAA TACAATCTAT GGTATGCACT GGTTACTAGA TGTTGATAAC TGGTATGGAT TTGGTTCTAG GGGAGACGGA ACCTCAAAAC CATCCTATAT CAACACATTC CAACGTGGAG AACAGGAATC AACATGGGAA ACAATTCCTC AGCCATGCTG GGATACAATG AAATATGGTG GAACGAATGG TTTCCTCGAC TTATTCACTG GCGATAGTTC CTATGCACAG CAATTTAAGT ATACGGATGC ACCAGATGCT GATGCAAGAG CAATTCAGGC TGCTTATTGG GCAAGTGAAT GGGCGAAAGA TTATGGTGTA AATGTCGATA CTTATTCATC AAAAGCTACG ATGATGGGCG ATTATCTTCG TTATTCCATG TTTGATAAAT ATTTTAGAAA AATAGGTAAT TCTACAGTTG CTGGTACAGG ATATGATGCA TCACATTATC TGTTATCCTG GTATTATGCT TGGGGCGGTG GAATTACAGC TGATTGGGCA TGGGTTATTG GATCTAGCCA TAACCACTTT GGTTATCAAA ATCCTATGGC AGCTTGGGTT TTATCTCAAA ATTCGAAGTT TAAACCAAAA ACTACAAATG GACAGGCTGA CTGGGCAACA AGCCTTACTA GACAGCTTGA ATTCTATCAG TGGTTACAAT CTTCAGAAGG TGGGATTGCC GGTGGTGCTA GTAACTCAAA GAATGGTCGT TATGAAACTT GGCCAGCTGG AACAGCTACA TTCTATGGAA TGGGCTATGA AGCAAACCCA GTTTATAAAG ATCCAGGAAG TAATACCTGG TTTGGTTTCC AGGCATGGTC AATGCAACGT GTTGCAGAAT ATTATTATAA GACAAATGAT GTAAAAGCAA AACAGATCCT AGATAAATGG GTAGCTTGGG TTAAATCTGT GGTTGTATTA AAAGCAGATG GAACCTTTAC GATACCAAGT ACACTTGACT GGAGTGGTCA GCCAGATACA TGGACAGGCT CCTATACTGG AAACTCTAAG CTTCATGTGA CAGTGGTTGA TTCTGGTACC GACCTTGGCG TTACAGGATC TCTTGCAAAT GCTTTGTTAT ATTATAGTAA AGCTGCAAAC GATGTAGCTG CAAAGAACTT AGCAAAAGAA TTATTAGATC GCGTATGGAA ATTATATCGT GATGATAAGG GCGTTGCTGC TCCTGAAGCA CGTGCAGATT ATAAGAGATT CTTTGAGCAG ACCGTTTATG TTCCAAGTAC CTTTAATGGT AAGATGCCAA ATGGTGATGT GATTAAGTCT GGTATTAAAT TCTTAGATAT CCGTTCTAAG TATTTACAAG ACCCATCTTA TCCAAAGTTA CTGGCAGCAT ATCAGAGTAA TAAGTCACCA GAGTTTATAT ATCATAGATT CTGGGCTCAG TGTGATGTAG CGCTTGCAAA TGGTGTATAT GCACTCCTTT ATGAGAATGG TTCTGGCACT ACTGATTATG CTAATATTAA TCCTACAAAT GGTTCCTTTG ATAAAGCTGT AGGAAAACAA GCAGATCTTA GTACAACATT ATCAATGCAA GGTTATACCT TTGTTAACTT AAGTAAAGGT ACTACACCTC TGACTTTAAA TACAGACTAT ACTGTTAATG GTACTACAGT AGTCCTTAAG AAAGAATTCC TATCTACATT ACCACTTGGT GATACTACAA TTACTTTTAA TTTTAGCAAT TCTTATACAA AACCTTTTGT AGTAACCGTT GTGGATACAA CAGTAGTTGT GGTAGTAGGA GATGTTAAGG TTCAGATGTT CAATGGAAAT ACTAGTGCAA CAACGAATGG AATTGCACCT CGTTTTTATC TTGTAAATAC AGGCTCTAAT AGTATTAATC TTTCTGATGT AAAGCTTCGT TACTACTATA CAATTGATGG TGAGAAGAGC CAGAGTTTCT GGTGTGATTG GTCATCGATT GGAAGCAGTA ATGTTACCGG AACTTTTGTA AAGATGGCAA CTCCAAAGAC TGGAGCAGAT TATTATCTTG AGATTGGATT TACAAGTGGT GCTGGTTCTT TAAAGGCAGG ACAGGGAATC GAAGTTCAAG GTAGATTCTC AAAAACTGAC TGGTCTAATT ATACCCAGAC TGGGGATTAT TCATTTAATA GCAGCGGTAA CTCCTATGTT GATTGGAACA AGGCAACTGC TTATATAAGT GGAAAACTTA ATTGGGGTAT CGAACCATAA
|
Protein sequence | MQKRIYKRVA AAIMTAAMVV TLVPQGAKTS ALAGETEQAL AAAATRGTYE QRFMDLWSDI KNPKNGYFSP QGIPYHSIET MIVEAPDYGH VTTSEAMSYY MWLEAMYGKF TGDFSGYGTA WNVAEKYMIP TDADQPPTSM SKYTPSKPAT YAPEYQDPSQ YPAKLDSSAP VGSDPIWSQL VAAYGRNTIY GMHWLLDVDN WYGFGSRGDG TSKPSYINTF QRGEQESTWE TIPQPCWDTM KYGGTNGFLD LFTGDSSYAQ QFKYTDAPDA DARAIQAAYW ASEWAKDYGV NVDTYSSKAT MMGDYLRYSM FDKYFRKIGN STVAGTGYDA SHYLLSWYYA WGGGITADWA WVIGSSHNHF GYQNPMAAWV LSQNSKFKPK TTNGQADWAT SLTRQLEFYQ WLQSSEGGIA GGASNSKNGR YETWPAGTAT FYGMGYEANP VYKDPGSNTW FGFQAWSMQR VAEYYYKTND VKAKQILDKW VAWVKSVVVL KADGTFTIPS TLDWSGQPDT WTGSYTGNSK LHVTVVDSGT DLGVTGSLAN ALLYYSKAAN DVAAKNLAKE LLDRVWKLYR DDKGVAAPEA RADYKRFFEQ TVYVPSTFNG KMPNGDVIKS GIKFLDIRSK YLQDPSYPKL LAAYQSNKSP EFIYHRFWAQ CDVALANGVY ALLYENGSGT TDYANINPTN GSFDKAVGKQ ADLSTTLSMQ GYTFVNLSKG TTPLTLNTDY TVNGTTVVLK KEFLSTLPLG DTTITFNFSN SYTKPFVVTV VDTTVVVVVG DVKVQMFNGN TSATTNGIAP RFYLVNTGSN SINLSDVKLR YYYTIDGEKS QSFWCDWSSI GSSNVTGTFV KMATPKTGAD YYLEIGFTSG AGSLKAGQGI EVQGRFSKTD WSNYTQTGDY SFNSSGNSYV DWNKATAYIS GKLNWGIEP
|
| |