Gene Cphy_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3011 
Symbol 
ID5743337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3676460 
End bp3678076 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content39% 
IMG OID641294112 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001560107 
Protein GI160881139 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGA TAAAAGTTTC GAATCCTATT CTTTCTGGAT TCTATCCAGA CCCATCCATA 
GTCCGCGTCG GACAGGATTA CTATATGGTA AATTCCACAT TTTCTTACTT CCCAGGGGTA
CCTTTATCTC ATAGTACTGA CTTAATTCAC TGGGAACAGA TAACAAATAT TCTATCTACT
AAAAAACAGT TGAATCTTGC AAATTCTCCT CATAGTGGTG GTATCTATGC TCCAACCATC
CGCTATCATA AAGGTACTTT CTATATGATT ACTACCAATG TTTCTCATGG AGGAAATTTT
ATTGTAACTG CAACCAATCC TCTCGGACCT TGGTCAGAAC CTTACTTTTT AAATGGTGCA
GAGGGAATTG ATCCTTCCCT TTTCTTTGAT GAAGACGGAA CTTGCTATTA TTGTGGAACG
AAAGGCCGCA GAGAAGGATC TGCTTTCTTC GGTGATAATG AGATTTATGT ACAAGAAGTT
GACTTAACTA CCATGCAGTT AACAGGCGAA TCTTATGCCA TATGGCACGG TGCCCTAAAA
GGAGTTGAGT GGCCAGAAGG TCCGCATATT TATAAACGTG ATGGTTGGTA TTATCTCATG
ATTGCAGAGG GTGGTACAGG ACTAAATCAT GCTATTACTA TGGCTAGAAG CAAGAATATA
AAAGAAACGT TTGAAGGATG CAAAAGAAAC CCTATCTTCA CTCATCGTCA TCTTGGAAAA
CAGTATTGGG CAATCAATAC CGGGCATGCT GATATCGTAG AGACAGAACA CGGAGACTGG
TATATGGTAT TGCTTGCAAG TAGGCCATGT GATGGTTACT GCTTACTTGG AAGAGAGACT
TTCCTGGTTC CACTCATCTG GGAAGATGGA TGGCCTATTG TAAATCCAGG TGTTGGACTT
TTAGATAGAA TAGTTACTAT CCAAGTAAAG GATTCTTCTA CTTTAGTGGA AGCCAATGAG
GCACAAGTAG GTGAAAAAGA ATTAGATTCT CTTTTAAAAG ACTACCATCC AACTTGTAGA
GATATTAAAG AAAATTTCCG TCAGAAGGAT TTGCCTCCTT ATTTCTTCTA CTTAAGAAAT
CCTCAGGAAG ATCACTATGA AACAGGCAGA GAAACTGGTC TTCGCTTATA CGCCAGTGAT
GTATCACTCA CAGCAGATGC TTCTCCAACA GCGCTCTTTC TTCGCCAGAC TTCTATTAAT
TACACACTGG GTACCAAACT AGAATCCACT CTAACGAATG AAAATAGTGA AGCTGGTATC
CTCCTAATGC AAAGTAATCA TTTTCATTAC CGTTTTTGTA TCTATAAGAG TAACGTTCCT
ATGGTTGTAT TAATATCTTG TATCGAGGGA AAAGAGCAGT TTTTAATTAA GAGAGAATTG
TCGAAATTCC CTTCCTATCT GCAGGTGAGG GAAGAAGACC TAAACTTAAG CTTTTTCTAC
TCTTTCGATG GAACAGAATA CCAAACTGTT GCTGCTTCGA TTGATGCGAG TATCCTTAGC
ACAGAACGTG CTGGCGGCTT TGTTGGTACC TGCCTTGGTT TATATACCTA TACACCTACC
AAGGAATTTG GAGAGGATTT TGTAGATTTT GATTATCTGC ATTATCAGGT AATGTAG
 
Protein sequence
MKQIKVSNPI LSGFYPDPSI VRVGQDYYMV NSTFSYFPGV PLSHSTDLIH WEQITNILST 
KKQLNLANSP HSGGIYAPTI RYHKGTFYMI TTNVSHGGNF IVTATNPLGP WSEPYFLNGA
EGIDPSLFFD EDGTCYYCGT KGRREGSAFF GDNEIYVQEV DLTTMQLTGE SYAIWHGALK
GVEWPEGPHI YKRDGWYYLM IAEGGTGLNH AITMARSKNI KETFEGCKRN PIFTHRHLGK
QYWAINTGHA DIVETEHGDW YMVLLASRPC DGYCLLGRET FLVPLIWEDG WPIVNPGVGL
LDRIVTIQVK DSSTLVEANE AQVGEKELDS LLKDYHPTCR DIKENFRQKD LPPYFFYLRN
PQEDHYETGR ETGLRLYASD VSLTADASPT ALFLRQTSIN YTLGTKLEST LTNENSEAGI
LLMQSNHFHY RFCIYKSNVP MVVLISCIEG KEQFLIKREL SKFPSYLQVR EEDLNLSFFY
SFDGTEYQTV AASIDASILS TERAGGFVGT CLGLYTYTPT KEFGEDFVDF DYLHYQVM