Gene Cphy_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0218 
Symbol 
ID5745086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp266504 
End bp268846 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content40% 
IMG OID641291308 
Productalpha-xylosidase YicI 
Protein accessionYP_001557344 
Protein GI160878376 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTCA GTAATGGATG TTGGTTACAA AAGAAAGGTA CTGAATGTTT TTCACCAGTG 
CAGGTTTATG ATTATCAAAT CAAAGAAAAC GAGGTAAGAA TACTTACCAC AACACATCAA
ATCAATCACC GCGGTGATAC CTTAGGTGGC GTGAATCTTA CGATATTTAT TACAGCACCG
GCACCGGAAG TTTTGCGTGT TAAGACTTAT CACTATATGG GAGTTAGAAA GAAGAGTCCT
GAGTTTGAGT TAGACTTATC AGGGGCTTGT ACACTAGAGT GTGAGGACAC AGAGGATATT
TTAATAATTA AAAATGGAAG TCTTCGTTTG GAAGTACAAA AATCAAACGC AGCTTTTGCT
TATTATCGTG GGAATGAGAA ATTAACCTCC AGCGGTTGGC GAGATCTAGC CTATATGAAA
ACAGATTGGC AAGGGCTTGC TTACGATGAT GGTGGTGAAG AGGATACTTA CATGAGAGAG
CAATTAACGC TTTCTGTAGG TGAACTTGTT TATGGACTTG GTGAGAGATT TACACCGTTT
GTAAAAAATG GTCAATCTCT TGATATCTGG AATGAAGATG GCGGTACCTC TACCGAACAA
TCTTATAAGA ACATTCCATT TTACATAACA AACAAAGGCT ATGGTGTCTT TGTTAATCAT
CCGGAGAAGG TTTCATTTGA AATTGGCTCT GAGATGGTGA CCAAAGTAGG CTTTTCAGTA
CCTGGAGAAT GCCTCGATTA CTTTATTATC AATGGGCCAG ATATAAAGCA GGTATTGGCT
CGCTATACTG ACTTAACAGG AAAGCCTGGG CTCCCTGCAC CTTGGACTTT TGGTCTGTGG
TTATCGACCT CCTTTACTAC AAATTACGAT GAAGAAACAG TAAATAGCTT TGTAGATGGC
ATGTTAGAAC GCGGAATCCA CCTAGGAGTA TTTCATTTTG ATTGTTTCTG GATGAAGGAT
TTTTGCTGGT CTGACTTTAC ATGGGATAGC AGAGTATTTC CAGATCCAAA GAATATGTTA
GCTAGATTAA AGGCAAAAGG GTTAAAGATT TGTGTTTGGA TTAATAGCTA TATCGGCCAG
GAATCGGTTC TTTTCGAAGA AGGTGTAAAA GGAGGCTATT TCCTAAAAAG AAAGAATGGG
GACGTATGGC AATGGGATAT GTGGCAGCCA GGTATGGCGG TTGTTGACTT TACAAACCCA
GAAGCGTGTA AGTGGTTTGG TGAGAAACTG AAAGCTTTAC TTGATATGGG AGTAGATTGC
TTTAAAACAG ATTTCGGTGA GAGAATCCCT ACTGACGTTG TTTACTACGA TGGTTCCGAT
CCGATGAAAA TGCATAATTA TTATACATAT TTATATAATA AAACAGTCTA TGATGTTTTG
GCTAGCTGTA AGGGAAGAGA AGAGGCAATT TTATTTGCTA GATCCGCAAC CGTAGGCGGG
CAAAAGTTCC CTGTTCATTG GGGTGGTGAT TGTTGGTCTG ATTATGAATC TATGGAAGAA
AGTCTTCGCG GAGGCTTATC CTTAACGATG TCTGGATTTG GGTATTGGAG TCATGATATC
GGAGGATTTG AGAGTACTTC AACACCGGAT GTATATAAGC GCTGGGCAGC TTTTGGTTTA
TTATCCACCC ATTCCAGACT TCATGGTAGT ACTTCGTATC GTGTTCCTTG GGCGTATGAT
GAAGAAGCAG TCGATGTGGT TCGTTTCTTT ACTGAGTTAA AAGGTTCCCT TATGCCATAC
CTTTATCGTA ATGCTGTGGA GACCTCAGAA TCAGGTATTC CAATGATGAG AAGTATGGTG
ATGGAATATA CCAAAGATCC AAATTGCTCT TACCTAGATA AGCAGTATTT TCTCGGCGAC
AGTTTACTAG TTGCTCCGAT ATTTAATGAG AATAGTATGG CTCATTATTA CTTACCTAAG
GGGAAATGGA CCAATTATCT GACCGGTGAA GTAAAAGAGG GTGGGAGATG GTATGAAGAA
GAACATAGTT ACCTAAGCAT ACCACTCTAT GTAAAACAAG GTAGCATCAT TGCCTCTGGT
CCGAAAGGTC AAGGAGCAGT CTATGAGTAT ACGAAGGATC TTGAGTTAAA AATATATGAG
TTACAAGAGG GGGTAGAGGT AGCAACTACT GTTTATCAAG AAACCGGAGA AAAGGCGGTA
GTAATGAGTG CAGTATTAAA TGGTGGTAAG ATAACAATAA ATCTTACATC AAAAGTACCG
GTAAGTATTG TCTTAAAGAA CTATGTCGTA AATAGCGTAG AAGGTGTTTC ATTTAAAGTG
GATGGAAACG ATACGATTAT TTTGGCACAG GAATCTGTGG TTGCTATAGT TGCAATTTCT
TAA
 
Protein sequence
MKFSNGCWLQ KKGTECFSPV QVYDYQIKEN EVRILTTTHQ INHRGDTLGG VNLTIFITAP 
APEVLRVKTY HYMGVRKKSP EFELDLSGAC TLECEDTEDI LIIKNGSLRL EVQKSNAAFA
YYRGNEKLTS SGWRDLAYMK TDWQGLAYDD GGEEDTYMRE QLTLSVGELV YGLGERFTPF
VKNGQSLDIW NEDGGTSTEQ SYKNIPFYIT NKGYGVFVNH PEKVSFEIGS EMVTKVGFSV
PGECLDYFII NGPDIKQVLA RYTDLTGKPG LPAPWTFGLW LSTSFTTNYD EETVNSFVDG
MLERGIHLGV FHFDCFWMKD FCWSDFTWDS RVFPDPKNML ARLKAKGLKI CVWINSYIGQ
ESVLFEEGVK GGYFLKRKNG DVWQWDMWQP GMAVVDFTNP EACKWFGEKL KALLDMGVDC
FKTDFGERIP TDVVYYDGSD PMKMHNYYTY LYNKTVYDVL ASCKGREEAI LFARSATVGG
QKFPVHWGGD CWSDYESMEE SLRGGLSLTM SGFGYWSHDI GGFESTSTPD VYKRWAAFGL
LSTHSRLHGS TSYRVPWAYD EEAVDVVRFF TELKGSLMPY LYRNAVETSE SGIPMMRSMV
MEYTKDPNCS YLDKQYFLGD SLLVAPIFNE NSMAHYYLPK GKWTNYLTGE VKEGGRWYEE
EHSYLSIPLY VKQGSIIASG PKGQGAVYEY TKDLELKIYE LQEGVEVATT VYQETGEKAV
VMSAVLNGGK ITINLTSKVP VSIVLKNYVV NSVEGVSFKV DGNDTIILAQ ESVVAIVAIS