Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3202 |
Symbol | |
ID | 5741980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 3898410 |
End bp | 3900641 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641294302 |
Product | cellulase |
Protein accession | YP_001560295 |
Protein GI | 160881327 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000270074 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AACTGAAACA AAGATGTGCT GTTTTAGTGG CAGTTGCAAC GATGATAGCT TCGTTGCAAT GGGGGAGAGT GCCAGTACAA GCAGTAACAG CAGACGGTCT TACCTCTCAA CAGTATGTTG AGGCAATGGG CGAAGGCTGG AACTTAGGAA ATTCCTTTGA TGGTTTTGAT TCTGATACTT CAAAACCAGA TCAAGGCGAG ACCGCTTGGG GAAATCCTAA GGTTACAAAA GAGCTAATCC ATGCAGTCAA ACAAAAAGGC TATAGTAGTA TCCGCATACC AATGACCCTA TATCGTAGAT ATACGGAGAG CAATGGTGTA TGCACTATCG ATAGCGCATG GATAGCACGT TACAAAGAAG TAGTAGATTA TGCAGTTGCA GAAGGTTTAT ACGTTATGAT AAACATTCAC CATGATTCCT GGATATGGTT ATCTTCATGG GATGGAAATA AGAGTTCTGT GCAATATGTA AGATTTACTC AGATGTGGGA TCAACTTGCG AAGGCATTTA AAGATTATCC GTTACAAGTA TGTTTTGAAA CGATAAATGA GCCGAACTTT CAAAACTCTG GAAACGTTAC TGCACAGAAT AAATTAGATA TGCTTAACCA AGCGGCTTAC AATATAATTC GTGCCTCTGG TGGATCAAAT GCAAAGAGAA TGATTGTTTT ACCATCACTA AATACGAACC ATGATAATAG TGTACCATTA GCTGATTTCA TAACTAAATT GAATGATTCT AATATCATTG CAACCGTTCA TTATTATAGT GAATGGGTAT TTAGTGCTAA CCTTGGTAAG ACAAGCTTTG ATGAAGATTT ATGGGGAAAT GGTGATTACA CTCCTCGTGA TGCGGTAAAT AAGGCGTTTG ATACCATTTC CAATGCATTT ACAGCAAAAA AAATCGGTGT TGTTATCGGA GAATTTGGTC TTTTAGGTTA TGACTCTGAT TTTGAAAATA ATCAACCAGG CGAAGAATTA AAATATTATG AGTATATGAA TTATGTAGCT AGACAAAAGA AAATGTGCCT TATGTTTTGG GATAACGGAT CTGGAATTAA TCGTAACGAC TCTAAGTATA GTTGGAAAAA ACCTATAGTT GGAAAGATGT TAGAAGTATC TATGACAGGA CGTTCCTCTT ATGCAACAGG CCTTGATACC ATTTACCTAA ACGGCAGCTC ATTTAATGAT ATTAATATCC CGCTTACTCT AAACGGTAAC ACCTTTGTTG GAGTTACAGG ATTAACCAGT GGTACCGATT TTACGTATAA CCAATCCAAT GCAACACTAA CATTAAAATC ATCCTACGTG AAGAAGGTTT ATGATGCAAT GGGAAGTAAT TATGGTACGG TAGCTGATTT GGTACTTAAG TTTTCAAGTG GAGCTGATTG GCATGAGTAT TTAGTGAAAT ACAAAGCACC AGTATTTCAA AATGCGAATG GAACTGTTTC CAATGGAATT AATATTCCAG TTCAATTTAA CGGAAGTAAA CTCCGTCGTT CTACAGCTTA TATAGGTTCT AATCGAGTTG GCCCGAATCA AAGCTGGTGG ATGTATTTAG AGTATGGTGC AACTTTTGTG GCGAACTATA CGAACAATAT TTTAACCATT AAGCCTGATT TCTTTAAGGA TGGTTCTGTT TATGATGGAA ATATATCATT TGAGATGGAG TTTTATGATG GACAAAAGTT AAAATATAAT CTTAATAAAT CAAATGGTAA CATAACAGGA ACTGCAGCAG CAGTAACCCC TACACCAACA CCAACGGCGA CACCAACACC AACAGCGACG CCAACACCAA CCGTAACACC AAAACCAACA ATAACCCCAA CAGTAACGCC GACACCAACA GTAACGCCAA AACCAACAAT AACACCGACA GTAACACCAA CTCCTACTCC AATCCCAGGA ACAGGTCCAG TTACATTAAA ATACGAAGTA ACGAATACTT GGGATAAGCA TACACAGGCG AATATTACAT TAACCAATAC CTCTAATACA GCACTAAAGA ATTTTGTTGT ATCATTTACT TATAAAGGGT ATATAGACCA AATGTGGAGT GCAGATTTGG TTAGTCAAAA TTCGGGTACC ATTACAGTGA AGGGACCAGC ATGGGCTACG AATCTAGATC CAGGGCAAAG TATAACATTT GGTTTTATTG CTTCACATGA TACACCGTCT GTTGATCCAC CATCAAATGT TACTTTAGTT AGTTCAAATT AA
|
Protein sequence | MKRKLKQRCA VLVAVATMIA SLQWGRVPVQ AVTADGLTSQ QYVEAMGEGW NLGNSFDGFD SDTSKPDQGE TAWGNPKVTK ELIHAVKQKG YSSIRIPMTL YRRYTESNGV CTIDSAWIAR YKEVVDYAVA EGLYVMINIH HDSWIWLSSW DGNKSSVQYV RFTQMWDQLA KAFKDYPLQV CFETINEPNF QNSGNVTAQN KLDMLNQAAY NIIRASGGSN AKRMIVLPSL NTNHDNSVPL ADFITKLNDS NIIATVHYYS EWVFSANLGK TSFDEDLWGN GDYTPRDAVN KAFDTISNAF TAKKIGVVIG EFGLLGYDSD FENNQPGEEL KYYEYMNYVA RQKKMCLMFW DNGSGINRND SKYSWKKPIV GKMLEVSMTG RSSYATGLDT IYLNGSSFND INIPLTLNGN TFVGVTGLTS GTDFTYNQSN ATLTLKSSYV KKVYDAMGSN YGTVADLVLK FSSGADWHEY LVKYKAPVFQ NANGTVSNGI NIPVQFNGSK LRRSTAYIGS NRVGPNQSWW MYLEYGATFV ANYTNNILTI KPDFFKDGSV YDGNISFEME FYDGQKLKYN LNKSNGNITG TAAAVTPTPT PTATPTPTAT PTPTVTPKPT ITPTVTPTPT VTPKPTITPT VTPTPTPIPG TGPVTLKYEV TNTWDKHTQA NITLTNTSNT ALKNFVVSFT YKGYIDQMWS ADLVSQNSGT ITVKGPAWAT NLDPGQSITF GFIASHDTPS VDPPSNVTLV SSN
|
| |