Gene Cphy_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3329 
Symbol 
ID5741609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4051097 
End bp4054015 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content37% 
IMG OID641294430 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001560421 
Protein GI160881453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAATG CATCATATCT TGAAACACCA TTATGGGATA ATAGTTTACC TCTTAAGAAA 
AGGTTAGATT ATCTCGTTGA AAACCTGACA TTGGAAGAAA AATTTGAGTT TCTAGGGACA
GGATGTCCGA CGATTGAACG CCTTGGAATA CAAAGTACCT TTCATGGAGG AGAGGCTGCG
CATGGTATAG AAGCAAGACA CGACCAGTCA TTTAATAAAG GAGAGCCGGA ACCAACGACA
ATATTTCCGC AACCGATTGG AATGAGCTCT ACTTGGGATA CGACACTTTT AACTAAGATT
GGCGTAACAG TTGGTAATGA AGCTAGAGTT TTATATCAAC GTCACAAAAA TGGAGGTTTG
TGCCGTTGGG CGCCAACGAT AGACATGGAA CGTGATCCAC GTTGGGGACG AACAGAAGAA
GCTTATGGGG AGGACCCCTA TCTTACAGGG AAGATGGCTT CTGCATATAT TCAAGGTATG
CGTGGAGTTG ATCCGTTTTA TATTCGTTGC GGTGCGACCT TAAAGCATTT CTACGCTAAT
AATACAGAAA AGGATAGAAT TTTTTCATCC TCTTCTATCG ACCCTAGAAA TAAACATGAG
TATTATTTAG AACCTTTCAA ACGTGCAATT ACAGAGGGGA AAGCCGAAGC TATTATGACG
GCATACAACG AAATTAATGG AGTCCCTTGT ATTGTAAATA ATGAAGTAAA AAACATTGTA
AAAGAACAGT GGGGGCTTCG TGGGCATGTT GTATGTGATG GTGGTGACAT GATGCAAACA
GTTAGTGATC ATAAATATTT TGGCTCCCAT GCAGAAACCA TTGCCTATGG GTTAAAAGCA
GGTATTGATT GTTTCACTGA TAATCGTGAA GTGGTTAAAC AAGCAGCCAA AGAAGCTTAT
CAAGCCGGGT TGATTACAGA GGCAGATTTG GATACCTCGA TTTGTAATTC ATTTTCAACC
AGAATCCGTC TTGGTTATTT TGATGCGATA GGACAAAACC CATATGCACA TATAACCGAA
AAATATATTA ATAGTGAAGA GAATAAATCT TTAACGCTAG AAGCTGCAAC AAAAGCCATG
GTATTATTAA AGAATGAAGG ACAGATATTG CCACTTACTA AAGAAAATTC ATCCTTTTGT
GTGATCGGTC CATTATCCGA TGTTTGGTAT AAAGATTGGT ATAGTGGGAT ACCACCATAT
TCCGTAACAC CATTACAAGG TATCAAAGAG TACAACAAAG GGACTCGTAA GAATCTTACG
AATTCTAAGG TGGTGGATGG ATTACCTCGC GTTAGAATTC GATATCAAGA GAAATATCTT
TGTGTAACAG AGGAAGGTTT TGTCACACTA GGGAGTAAAG AATTAGCAGA GACATTTACT
ATCACAGACT GGGGTAACGG AAATCTGACC ATAGCGACTA GTCAAGGGAA ATACCTGTCA
GCAGATGAGG AAGAAGGACT TATTACTGCA ACTAAGACGG AAGTATTTGA ATGGTTTGTA
AAAGAAGCAT TTGATTTCCA TCTTGTGGGT AATTTAGGCG AAAAAAGTTT TCAAACTATA
TTTGAAATTC TAAGTGAACA AAAAAGACAG GAAATATCCG TAACAATAGA TACTTGGAAG
GATGATTATG TAGCGATTAA TAAGGAAGGA AAGCTTTCAT TTTCTAAGGA AGAAAAAGTT
ACATTGGATC TTGAACTTGT ATCCGATGGT ATCGAAGAAG CGAAAGAATA TACAAAGAAT
AGTGATTATG CTATTCTAGT AATGGGTTGT AATCCTGTTA TTAACAGTAA AGAAGAAATT
GATCGGAATG ATTTAGATCT ACCGCCATAT CAAGAAAGAT TAATAAAGCA AGTACATAAA
GTAAATCCTA AGGTAATTCT TGTACTTATA ACAAATTATC CATATGCAAT ACGCTGGGAA
AAGGAACATA TTCCGGCAAT TATAACGACT ACATCTGGAA GTCAAGAACT TGGTAATGCG
ATTGCTGCTG TGTTATTTGG TGATGTATCA CCATCGGGTA GGTTGCCAAT GACTTGGTAT
CTCGATACGA AGGATTTACC ACCGATAGAG GATTATGATA TTATTCGCGG TAACCGAACT
TACCAGTATT TTAACAAAGA GGTGTTATAC CCATTTGGAC ATGGTCTTAC CTATACAACT
ATGCAATATC AAAAGCTTAC AGTGCAATTA GAGGATTTTA CTAATCTATT AATTAAAGTA
ACTATTGCAA ATACAGGAAA TAGAATTAGT GATGAAGTGG TGCAGGTTTA TGTGAGGCAG
GAAGTCTCAA GAACTGTTAG ACCTCGTTTG CAGCTAAAAG CTTTTGAGAG AGTAAAGAAT
ATCTTACCTG GAGAGAAGAG GGAGATAGAA TTTATAATTT CTACCAGTGA TTTAACCTAT
TATGATGTAG TAAATGGTGG TATGATATTA GAAGAATCTG AGTATACAAT CTTAGTTGGG
GCATCTTCTG AGGACATTAG ACTACGCGAA ACAGCTTTCA TTCCTGGAGT GAAAGTGGGA
TCTCGAAATC TTAGGAAAAA GATTTTAGCG GATCATTATG ACGACTGCAG GAATTCGTAC
CTACACCGTG GAAGCTTAGG TGATACTGCG GTTGTTGTTA AGAATAAAAC TGAAGCAGCG
ATACTTCTTT ACCGAGATGT TAGAATAGAA GAAAAGCCAA TTAAATTTCA TTCTACCGTT
CAATGTGTGG GTGAAGGAAG TCTGATAGTT TCATACATTA AGCAAAGTGA TGTAGCGAGT
TCCGAGGTAA AACTAGGTGA TATTACTTTA GAGAATCAAG ATAAGTTTTG TGATGTTATC
ATACCAATTA ACTGGGATAG AATAACATGC AATGAAGTCA TTACATTAAA AATTACATTG
TTAGAAGAGA TGAAATTATC CTCTTTCTAT GCAGAATGA
 
Protein sequence
MGNASYLETP LWDNSLPLKK RLDYLVENLT LEEKFEFLGT GCPTIERLGI QSTFHGGEAA 
HGIEARHDQS FNKGEPEPTT IFPQPIGMSS TWDTTLLTKI GVTVGNEARV LYQRHKNGGL
CRWAPTIDME RDPRWGRTEE AYGEDPYLTG KMASAYIQGM RGVDPFYIRC GATLKHFYAN
NTEKDRIFSS SSIDPRNKHE YYLEPFKRAI TEGKAEAIMT AYNEINGVPC IVNNEVKNIV
KEQWGLRGHV VCDGGDMMQT VSDHKYFGSH AETIAYGLKA GIDCFTDNRE VVKQAAKEAY
QAGLITEADL DTSICNSFST RIRLGYFDAI GQNPYAHITE KYINSEENKS LTLEAATKAM
VLLKNEGQIL PLTKENSSFC VIGPLSDVWY KDWYSGIPPY SVTPLQGIKE YNKGTRKNLT
NSKVVDGLPR VRIRYQEKYL CVTEEGFVTL GSKELAETFT ITDWGNGNLT IATSQGKYLS
ADEEEGLITA TKTEVFEWFV KEAFDFHLVG NLGEKSFQTI FEILSEQKRQ EISVTIDTWK
DDYVAINKEG KLSFSKEEKV TLDLELVSDG IEEAKEYTKN SDYAILVMGC NPVINSKEEI
DRNDLDLPPY QERLIKQVHK VNPKVILVLI TNYPYAIRWE KEHIPAIITT TSGSQELGNA
IAAVLFGDVS PSGRLPMTWY LDTKDLPPIE DYDIIRGNRT YQYFNKEVLY PFGHGLTYTT
MQYQKLTVQL EDFTNLLIKV TIANTGNRIS DEVVQVYVRQ EVSRTVRPRL QLKAFERVKN
ILPGEKREIE FIISTSDLTY YDVVNGGMIL EESEYTILVG ASSEDIRLRE TAFIPGVKVG
SRNLRKKILA DHYDDCRNSY LHRGSLGDTA VVVKNKTEAA ILLYRDVRIE EKPIKFHSTV
QCVGEGSLIV SYIKQSDVAS SEVKLGDITL ENQDKFCDVI IPINWDRITC NEVITLKITL
LEEMKLSSFY AE