Gene Cphy_3202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3202 
Symbol 
ID5741980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3898410 
End bp3900641 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content38% 
IMG OID641294302 
Productcellulase 
Protein accessionYP_001560295 
Protein GI160881327 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000270074 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AACTGAAACA AAGATGTGCT GTTTTAGTGG CAGTTGCAAC GATGATAGCT 
TCGTTGCAAT GGGGGAGAGT GCCAGTACAA GCAGTAACAG CAGACGGTCT TACCTCTCAA
CAGTATGTTG AGGCAATGGG CGAAGGCTGG AACTTAGGAA ATTCCTTTGA TGGTTTTGAT
TCTGATACTT CAAAACCAGA TCAAGGCGAG ACCGCTTGGG GAAATCCTAA GGTTACAAAA
GAGCTAATCC ATGCAGTCAA ACAAAAAGGC TATAGTAGTA TCCGCATACC AATGACCCTA
TATCGTAGAT ATACGGAGAG CAATGGTGTA TGCACTATCG ATAGCGCATG GATAGCACGT
TACAAAGAAG TAGTAGATTA TGCAGTTGCA GAAGGTTTAT ACGTTATGAT AAACATTCAC
CATGATTCCT GGATATGGTT ATCTTCATGG GATGGAAATA AGAGTTCTGT GCAATATGTA
AGATTTACTC AGATGTGGGA TCAACTTGCG AAGGCATTTA AAGATTATCC GTTACAAGTA
TGTTTTGAAA CGATAAATGA GCCGAACTTT CAAAACTCTG GAAACGTTAC TGCACAGAAT
AAATTAGATA TGCTTAACCA AGCGGCTTAC AATATAATTC GTGCCTCTGG TGGATCAAAT
GCAAAGAGAA TGATTGTTTT ACCATCACTA AATACGAACC ATGATAATAG TGTACCATTA
GCTGATTTCA TAACTAAATT GAATGATTCT AATATCATTG CAACCGTTCA TTATTATAGT
GAATGGGTAT TTAGTGCTAA CCTTGGTAAG ACAAGCTTTG ATGAAGATTT ATGGGGAAAT
GGTGATTACA CTCCTCGTGA TGCGGTAAAT AAGGCGTTTG ATACCATTTC CAATGCATTT
ACAGCAAAAA AAATCGGTGT TGTTATCGGA GAATTTGGTC TTTTAGGTTA TGACTCTGAT
TTTGAAAATA ATCAACCAGG CGAAGAATTA AAATATTATG AGTATATGAA TTATGTAGCT
AGACAAAAGA AAATGTGCCT TATGTTTTGG GATAACGGAT CTGGAATTAA TCGTAACGAC
TCTAAGTATA GTTGGAAAAA ACCTATAGTT GGAAAGATGT TAGAAGTATC TATGACAGGA
CGTTCCTCTT ATGCAACAGG CCTTGATACC ATTTACCTAA ACGGCAGCTC ATTTAATGAT
ATTAATATCC CGCTTACTCT AAACGGTAAC ACCTTTGTTG GAGTTACAGG ATTAACCAGT
GGTACCGATT TTACGTATAA CCAATCCAAT GCAACACTAA CATTAAAATC ATCCTACGTG
AAGAAGGTTT ATGATGCAAT GGGAAGTAAT TATGGTACGG TAGCTGATTT GGTACTTAAG
TTTTCAAGTG GAGCTGATTG GCATGAGTAT TTAGTGAAAT ACAAAGCACC AGTATTTCAA
AATGCGAATG GAACTGTTTC CAATGGAATT AATATTCCAG TTCAATTTAA CGGAAGTAAA
CTCCGTCGTT CTACAGCTTA TATAGGTTCT AATCGAGTTG GCCCGAATCA AAGCTGGTGG
ATGTATTTAG AGTATGGTGC AACTTTTGTG GCGAACTATA CGAACAATAT TTTAACCATT
AAGCCTGATT TCTTTAAGGA TGGTTCTGTT TATGATGGAA ATATATCATT TGAGATGGAG
TTTTATGATG GACAAAAGTT AAAATATAAT CTTAATAAAT CAAATGGTAA CATAACAGGA
ACTGCAGCAG CAGTAACCCC TACACCAACA CCAACGGCGA CACCAACACC AACAGCGACG
CCAACACCAA CCGTAACACC AAAACCAACA ATAACCCCAA CAGTAACGCC GACACCAACA
GTAACGCCAA AACCAACAAT AACACCGACA GTAACACCAA CTCCTACTCC AATCCCAGGA
ACAGGTCCAG TTACATTAAA ATACGAAGTA ACGAATACTT GGGATAAGCA TACACAGGCG
AATATTACAT TAACCAATAC CTCTAATACA GCACTAAAGA ATTTTGTTGT ATCATTTACT
TATAAAGGGT ATATAGACCA AATGTGGAGT GCAGATTTGG TTAGTCAAAA TTCGGGTACC
ATTACAGTGA AGGGACCAGC ATGGGCTACG AATCTAGATC CAGGGCAAAG TATAACATTT
GGTTTTATTG CTTCACATGA TACACCGTCT GTTGATCCAC CATCAAATGT TACTTTAGTT
AGTTCAAATT AA
 
Protein sequence
MKRKLKQRCA VLVAVATMIA SLQWGRVPVQ AVTADGLTSQ QYVEAMGEGW NLGNSFDGFD 
SDTSKPDQGE TAWGNPKVTK ELIHAVKQKG YSSIRIPMTL YRRYTESNGV CTIDSAWIAR
YKEVVDYAVA EGLYVMINIH HDSWIWLSSW DGNKSSVQYV RFTQMWDQLA KAFKDYPLQV
CFETINEPNF QNSGNVTAQN KLDMLNQAAY NIIRASGGSN AKRMIVLPSL NTNHDNSVPL
ADFITKLNDS NIIATVHYYS EWVFSANLGK TSFDEDLWGN GDYTPRDAVN KAFDTISNAF
TAKKIGVVIG EFGLLGYDSD FENNQPGEEL KYYEYMNYVA RQKKMCLMFW DNGSGINRND
SKYSWKKPIV GKMLEVSMTG RSSYATGLDT IYLNGSSFND INIPLTLNGN TFVGVTGLTS
GTDFTYNQSN ATLTLKSSYV KKVYDAMGSN YGTVADLVLK FSSGADWHEY LVKYKAPVFQ
NANGTVSNGI NIPVQFNGSK LRRSTAYIGS NRVGPNQSWW MYLEYGATFV ANYTNNILTI
KPDFFKDGSV YDGNISFEME FYDGQKLKYN LNKSNGNITG TAAAVTPTPT PTATPTPTAT
PTPTVTPKPT ITPTVTPTPT VTPKPTITPT VTPTPTPIPG TGPVTLKYEV TNTWDKHTQA
NITLTNTSNT ALKNFVVSFT YKGYIDQMWS ADLVSQNSGT ITVKGPAWAT NLDPGQSITF
GFIASHDTPS VDPPSNVTLV SSN