Gene Cphy_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3081 
Symbol 
ID5743167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3767612 
End bp3771061 
Gene Length3450 bp 
Protein Length1149 aa 
Translation table11 
GC content40% 
IMG OID641294182 
Productglycoside hydrolase family protein 
Protein accessionYP_001560177 
Protein GI160881209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATTT TAGATGGATT ATGGAAGTTT AGAATAGACC CTCAAAATGA AGGGTTTAAA 
AATCATTGGG AAAGAGATAA CTTTAAGGAT TGCATTCGTA TTCCTGGCGT ATTGCAGGAA
CAAGGATATG GAGAAGAGGT ATCCCGTCAG ACAGAATGGG TGGAAGGGTT AAGTGATCCC
TTATGGTATC TAAGAGAAGA GTATCAATAT GAAAATGAAG ACAAGGTCAG AATACCTTTT
TTAAGCCAGC CGCCTAAAGT ATATCGTGGA AAGGCATGGT ATCAGATGAC TATTAAGTCA
GAAGACATCA CGGAGGGGAA ATGGTATTCC TTATACATAG AAAATACCAG ATGGAGAAGC
CACGTATGGT GTGATGATAC GTACCTTGGA GAGGATTTTA GCCTGTGTAC TCCTCATGAA
ATTTCCATTG GTTTTTTAAA AAAGAAAGAG TATAAACTAA CCATTTGCTT AGATAATAGT
ATGCAACTTC CATATCGTCC CGATGGGCAT GGCGTTTCTG ATGCGTTTTC GGCAGGCTGG
AATGGGATGG TTGGAAGGGT TGAATTACAT AGCCATTTGG CAGTTGCGAT CAAAGAGGTG
AGAGTATTTT CGGATATTGA CCGAAAGAGA GCCACCTTTG AAGTAACGAT ACAAAACTAT
ACCAAAGAAA CAGTTAAGAC ATCGCTTGTA ATAGGGAGTG AAAACGAGAG AAATTTTCAA
ACTTCACTAG ACATAAAACA AGAACTTACA GTTATGTGTC AGGAAGGGGT TACGCTTGTA
ACTTTGATGA AGGAATATGA CCAAGATACA AAGCTTTGGG ATGAGTTTAG TGCAAACCTT
TTGAAAGAAT CGATAGCCTT AACGAGTGAT TATGGTACGG AAACAGAAGA AGTAACATTC
GGTTTTATAC AGGTGAAATC GGAAAATGGG CTTTTTCTTG TGAATCATCG CCCAACGTAT
TTTAGAGGAA CACATTTTGG AGGTGATTAT CCTCTGACTG GATATCCGGA TTGCAGTATT
AAATTCTGGC AGAAAATTAT GAAGGTAATT AAAGAATGGG GGCTAAACTT TATGCGCTTT
CATTCTTATT GTCCACCGGA AGCTGCATTT GTGGCAGCTG ATGAGGAAGG AATATATCTT
CAAGTGGAAT GTGGAATGTG GAATAGTTTT GAGCCAGGAG ATCCGATGCT TGAGGTAGCC
AAAATGGAAG CGGATAGAAT CCTTCGAACC TTTGGGAATC ATCCGTCTTT TGTGATGCTT
TCTCCATCCA ATGAACCAAT GGGAGAGTGG TTAGGGCCTT TGACTGAGTT TGTCGCGTAT
TGTAAAGAAA TCGATTCTAG AAAATTATAT ACGATCCAAT CTGGCTGGCC ATTTCCAATC
GAGCCCGATC AGGTGACGGG AACAGATTAT CTTTACTTCC ATCGTTCAGG CTTTGGTATA
GAACCGGGAG GTACCATAAG AAATTCTGCG GGATGGAAAG GGTCGGATTA TAGAGAGTCT
CTTAAGAATG TAACGCTTCC GGCTATCTGT CATGAACTAG GACAATGGTG TTCTTATCCT
GATTTTTCGA TTATTGATAA GTTTACTGGA TACCTATCCC CAAGTAACTT TGAAGTATTT
AGAGGTTCTG CAAAATCTCA TGGCGTGGAA CAGTACTCCA AAGAGTTTCA CTATAACTCT
GGAAAACTAC AAAGTTTGAT GTATAAAGAA GAGATTGAAG CGAACCTTAG AACCCCTCAT
CTTTATGGGT TTGAATTGTT AGATTTACAT GACTATCTTG GTCAGGGAAC TGCATTGGTT
GGTGTACTGG ATGCTTTTTG GGAGGAGAAA GGGTATATCA GTCCGAAGGA ATGGAAAAGG
TTTTGTTCTA AGACAGTACC GCTTGCTAGG ATTCCTAAGT ATGTTTATGA AGCAGGTGAG
ACGGTCACAG CTTCTGTTGA AATCAGCCAT TTTGGAGAAG CAAAACTTTC TCAAGCTACG
GTATCCTATG AACTAAGAGA TGGGACTACG ACTATAAAGA AAAGTATCCT AGGTAGAATG
GATATTCCAA TCGGAAAGAA TATTGAGGCA GGTACTATAA GGCTAGCACT GCCAAATAAT
CAGGAGTCTA CAGTTTATGA GCTTTTTGTT TGCATCGAGA CAAAAGAAGA ATGTTTTATT
AATAGTTGGA AGTTATGGAG TTATCAAAAT AATCACAAGG AATCCGAAGA ATTTATAGAA
GTCAAGAAAT CCAAAGAATT TATAGAAAAC AAGGAAGCCA AGGAGTTCAA GGAATCCAAA
GAATTCAAGG AATCCAAAGA ATCCAAGGAA TTCAAGGAAT TCAAGGAATC CAAAGAATCC
AAAGAATTAA AAGAATATTG CAGAGTTTTT TATACAAAAG ACTGGGAAGC AGCAGAGAAA
GCATTGGAAG AAGGGAAAAA AGTTTTATTT AATCCGAGAA TAGATTCCCT AAGTTATGAT
TGCCCCAAAT TACAGTTCAA ACCTATCTTT TGGAATGCAC AGATGGGCCC AACTTGGGCG
AGAGGCCTAA GCATTATGTG CGATCCTGCT CATCCGGTAT TTCGTCAGTT TCCAACAGAG
TCTTATGCAG AATGGCAGTG GGAGGATATC ATAGATGGGG CACGTGGAAT CAATCTTAGT
TGTTTTGGTG GAGAACTTAC TCCTATCGTT CGTGTAATAG ATGACTGGAA TCGAAACTAT
CCACTGGCTC TCATGCTAGA AGCAAAAGTT AGAAATGGAG GTCTGTTTAT GACTACCGTG
GATTTTAGAA CAGAAGGTAG GGCAGGAAAA TCACCGGCAG CTTCTGCTCT CATGCAAAGT
ATCTTTTGTT ATATGGAGAG TGAGGCGTTT GCCCCGAAAG AAAACGTATC TTTCGAGGAT
ATAAAAACCA TCTATCATGA AAATCATACA ATGGAACTGC TGGAGGTTTC AGTTGTACTG
GAAGAAGAGG AAACGAAGGA TATCGAGGTG ATATGCCATG GTAATGCAGA AGATTTTATG
TTAATTAAAG GTTACCCTGC TCACATACGC TTAGAAATGC CCAAAGCACA TTCATTCTAC
GGAATCTGCT ATGTCCCAAG ACAGAATCAC AGGGAACACG AGGGAGATAT CAAAGAATAC
CAAGTGGAAT ATCGAAAGGA AGGACATTGG GAGTTTTTGT GCGAAGGAGA ATTGAAAAGT
TCATTTTCAC CAAAGACTAT AACCTTTCCT AAAACAGTGA CGACAGATGC GATCCGTTTT
ATTGCCATTT CAGGATTTGG AGGAGAACAT GTAACGAAGT GGGATCTTGA AAAAGATGGT
TGGCATCAAA TGACCGGAGC GTATTCCGAT CCATATGTGT CGATTGCCTG CCTCACTCTT
TTAACGGAGG AAGGACTTGG AATCTGGAGA GGTGAAGTTA ATAATCAAAC AGTAGATTTT
AAGAGTGCAA CTACTGATAT AGAAAACTAA
 
Protein sequence
MIILDGLWKF RIDPQNEGFK NHWERDNFKD CIRIPGVLQE QGYGEEVSRQ TEWVEGLSDP 
LWYLREEYQY ENEDKVRIPF LSQPPKVYRG KAWYQMTIKS EDITEGKWYS LYIENTRWRS
HVWCDDTYLG EDFSLCTPHE ISIGFLKKKE YKLTICLDNS MQLPYRPDGH GVSDAFSAGW
NGMVGRVELH SHLAVAIKEV RVFSDIDRKR ATFEVTIQNY TKETVKTSLV IGSENERNFQ
TSLDIKQELT VMCQEGVTLV TLMKEYDQDT KLWDEFSANL LKESIALTSD YGTETEEVTF
GFIQVKSENG LFLVNHRPTY FRGTHFGGDY PLTGYPDCSI KFWQKIMKVI KEWGLNFMRF
HSYCPPEAAF VAADEEGIYL QVECGMWNSF EPGDPMLEVA KMEADRILRT FGNHPSFVML
SPSNEPMGEW LGPLTEFVAY CKEIDSRKLY TIQSGWPFPI EPDQVTGTDY LYFHRSGFGI
EPGGTIRNSA GWKGSDYRES LKNVTLPAIC HELGQWCSYP DFSIIDKFTG YLSPSNFEVF
RGSAKSHGVE QYSKEFHYNS GKLQSLMYKE EIEANLRTPH LYGFELLDLH DYLGQGTALV
GVLDAFWEEK GYISPKEWKR FCSKTVPLAR IPKYVYEAGE TVTASVEISH FGEAKLSQAT
VSYELRDGTT TIKKSILGRM DIPIGKNIEA GTIRLALPNN QESTVYELFV CIETKEECFI
NSWKLWSYQN NHKESEEFIE VKKSKEFIEN KEAKEFKESK EFKESKESKE FKEFKESKES
KELKEYCRVF YTKDWEAAEK ALEEGKKVLF NPRIDSLSYD CPKLQFKPIF WNAQMGPTWA
RGLSIMCDPA HPVFRQFPTE SYAEWQWEDI IDGARGINLS CFGGELTPIV RVIDDWNRNY
PLALMLEAKV RNGGLFMTTV DFRTEGRAGK SPAASALMQS IFCYMESEAF APKENVSFED
IKTIYHENHT MELLEVSVVL EEEETKDIEV ICHGNAEDFM LIKGYPAHIR LEMPKAHSFY
GICYVPRQNH REHEGDIKEY QVEYRKEGHW EFLCEGELKS SFSPKTITFP KTVTTDAIRF
IAISGFGGEH VTKWDLEKDG WHQMTGAYSD PYVSIACLTL LTEEGLGIWR GEVNNQTVDF
KSATTDIEN