Gene Cphy_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3367 
Symbol 
ID5741649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4103996 
End bp4106953 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content37% 
IMG OID641294470 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_001560459 
Protein GI160881491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TAATAAGTCT TTTATTAGTG ATAACACTTC TGATATCCAT GGCACCATCG 
AAAGCTGACG CAGCGGAAAC CAATTATAAT TACGGAGAAG CTCTTCAAAA ATCAATCATG
TTTTATGAGT TTCAACGTTC TGGTAAACTG CCAAGTACCA TTCGGAATAA TTGGAGAGGT
GACTCTGGTT TAACCGATGG AGCAGATGTT GGTTTGGATC TAACTGGTGG CTGGTATGAT
GCTGGTGATC ATGTAAAATT TAATCTTCCT TTGGCTTATA CTGTAACAAT GTTAGCATGG
GCAGTATATG AAGAAGAGGC TACTCTTTCA AAGGCAGGCC AATTAAGTTA TTTATTAGAT
GAAATTAAGT GGTCTAGTGA TTACCTAATT AAATGTCATC CACAAGCAAA TGTATTTTAT
TATCAGGTTG GTAATGGAAA TACAGATCAC TCTTGGTGGG GACCTGCTGA AGTTATGCAG
ATGGCTAGAC CGTCCTATAA GGTTGATTTA AATAACCCAG GTTCTACTGT AGTAGGAGAA
GCAGCAGCAG CTCTTGCAGC AACAGCACTT ATATATAAGA CAAAAGACCC TACTTATTCA
GCAACTTGCC TTCGTCATGC AAAAGAGCTT TTTAATTTTG CAGATACAAC AAAAAGCGAT
GCTGGATATA CAGCAGCAAG TGGGTTCTAT ACTTCCTATA GTGGATTTTA TGATGAATTA
TCCTGGGCAG CTACATGGAT TTACCTTGCA AGTGGAGAAG CGACCTATTT GGATAAGGCA
GAATCTTATG TAGCCAAATG GGGAACAGAA CCTCAATCTT CCACATTAAG TTATAAGTGG
GCACAAAACT GGGATGATGT TCACTATGGT GCAGCTTTAT TATTAGCAAG AATTACAAAT
AAAGCAATTT ATAAGAACAA TATTGAAATG CATCTTGACT ATTGGACTAC AGGGTATAAT
GGTAGTCGTA TTACTTATAC ACCAAAAGGA CTTGCTTGGT TAGATTCCTG GGGTGCATTA
AGATATGCGA CGACAACAGC ATTTCTAGCA AGTGTTTATG CTGATTGGAG CGGATGTAGT
GCTGGAAAAG TTAGTACTTA CAATGCATTT GCGAAACAGC AGGTAGATTA TGCATTAGGA
AGTACCGGAA GAAGTTTTGT GGTTGGATAT GGTGTAAATT CTCCAACAAG ACCTCATCAT
AGAACTGCTC ATAGTTCATG GGCAGACAGT CAGACGGAGC CAAATTACCA TAGACACACC
ATTTATGGTG CTTTAGTAGG TGGACCTGGT AATAATGATA GTTATGAGGA TAACATTAAT
AATTATGTAA ACAATGAAAT CGCTTGTGAC TATAATGCAG GTTTTGTTGG CGCATTGGCT
AAAGTTTATA AAACATATGG CGGAACACCA ATTGCAAACT TTAAGGCAAT CGAAACAGTA
ACAAACGATG AGTTATTTAT TCAAGCTGGT ATTAATGCCT CTGGTCCATC TTTTATCGAA
GTAAAGGCAT TGGTTTTCAA TGAGACAGGT TGGCCAGCTC GTGTTACCGA TAAATTATCC
TTTAAGTATT TTATTGATAT CTCGGAATAT GTAGCAAAGG GATATACAAA GAATGATTTT
ACGGTATCGA CAAATTATAA CAATGGAGCA ACCACATCGG CATTGCTTCC TTGGGATGCT
GCGAATAATA TCTATTATGT GAATGTAGAC TTCTCTGGAA CTAAGATTTA TCCTGGTGGA
CAGTCTGCAT ATAAGAAAGA AGTACAATTT AGAATTGCTG GTCCACAAAA CGTTAATATA
TGGGACAATT CCAATGACTA CTCCTTTACA CAAATTGCTA ATGTTAGTTC AGGAAATACC
GTAAAGACCA CATATATACC ATTGTATGAT AATGGTAAAT TAGTATTTGG TAATGAGCCA
AAGACGGGTG TTCCTTCTGC AAGTCTTGAT AAGACTACAG CAAACTTTGA CAAAAACCCA
GCTGTATCCG CAGATATACC AGTAACCATT AACTATAATG GTAATACATT AACAGCGGTT
AAGAATGGAA CAACGGTTTT AACGAAAGGT ACTGATTATA CTGTATCTGG TAATGTAGTA
ACGTTATCTA AGAATTATTT CTTAGCACAG AGCGCTAGTA CGGTTACTTT AACATTTGTA
TTTAGTGGCG GTAACGATGC AACATTAAAA GTGACTTTAG TAGATACTTC TCCAAGTGCA
TCCATTAATC CAAATTCTGC TGTCTTTGAT AAGGCTAGCG GAAAACAGGA AAATATAGTT
ATTACGCTTA CACCAAATGG CAATACCTTA GCTGGACTTA AGAATGGGTC TAAGAGCCTG
GTAACTGGAA CTGATTATAC CGTTTCCGGA ACAACAGTGA CGATTCTATC TTCTTATTTA
AGTCAATTTG CAGTAGGAAG TCAATCTATT GTATTTGAAA TGAATAAAGG GACAAATCCA
GTCTTAGCAG TTACCATTAA GGATTCTTCT GTTGTTACTC CAACAGGAAA TATTAAACTT
CAAATGTTTA ATGGAAATTC TTCTGCAACA ACGAATGGCA TTGCACCAAG AATTAAATTA
ATTAACACCG GAACTACTGC AATCAACTTA TCCGATGTTA AGATTCGCTA TTATTATACA
ATCAATGGCG AAAAGGATCA GGCATTCTGG TGTGATTATT CGACGATTGG TAGTTCCAAT
GTAAATGGTA CTTTCGTAAA GATGAGTACA CCAAAAACAA ATGCAGATTA CTATCTAGAA
TTTTCATTTA AGTCCGCTGC CGGAACTTTA AACGCAGGGC AAAGTATTGA AGTTCAAGGA
AGATTTTCTA AGGTAGACTG GACAAACTAT ACACAAACAG ATGATTATTC GTTTGGTGAT
AGTAACTCAA GTTATGCTGA TTGGAATAAG ACAACAGTAT ATATCTCTGA TGTTTTGGTT
TGGGGAGTCG AACCATAA
 
Protein sequence
MKKIISLLLV ITLLISMAPS KADAAETNYN YGEALQKSIM FYEFQRSGKL PSTIRNNWRG 
DSGLTDGADV GLDLTGGWYD AGDHVKFNLP LAYTVTMLAW AVYEEEATLS KAGQLSYLLD
EIKWSSDYLI KCHPQANVFY YQVGNGNTDH SWWGPAEVMQ MARPSYKVDL NNPGSTVVGE
AAAALAATAL IYKTKDPTYS ATCLRHAKEL FNFADTTKSD AGYTAASGFY TSYSGFYDEL
SWAATWIYLA SGEATYLDKA ESYVAKWGTE PQSSTLSYKW AQNWDDVHYG AALLLARITN
KAIYKNNIEM HLDYWTTGYN GSRITYTPKG LAWLDSWGAL RYATTTAFLA SVYADWSGCS
AGKVSTYNAF AKQQVDYALG STGRSFVVGY GVNSPTRPHH RTAHSSWADS QTEPNYHRHT
IYGALVGGPG NNDSYEDNIN NYVNNEIACD YNAGFVGALA KVYKTYGGTP IANFKAIETV
TNDELFIQAG INASGPSFIE VKALVFNETG WPARVTDKLS FKYFIDISEY VAKGYTKNDF
TVSTNYNNGA TTSALLPWDA ANNIYYVNVD FSGTKIYPGG QSAYKKEVQF RIAGPQNVNI
WDNSNDYSFT QIANVSSGNT VKTTYIPLYD NGKLVFGNEP KTGVPSASLD KTTANFDKNP
AVSADIPVTI NYNGNTLTAV KNGTTVLTKG TDYTVSGNVV TLSKNYFLAQ SASTVTLTFV
FSGGNDATLK VTLVDTSPSA SINPNSAVFD KASGKQENIV ITLTPNGNTL AGLKNGSKSL
VTGTDYTVSG TTVTILSSYL SQFAVGSQSI VFEMNKGTNP VLAVTIKDSS VVTPTGNIKL
QMFNGNSSAT TNGIAPRIKL INTGTTAINL SDVKIRYYYT INGEKDQAFW CDYSTIGSSN
VNGTFVKMST PKTNADYYLE FSFKSAAGTL NAGQSIEVQG RFSKVDWTNY TQTDDYSFGD
SNSSYADWNK TTVYISDVLV WGVEP