Gene Cphy_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0531 
Symbol 
ID5743445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp674706 
End bp676121 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content39% 
IMG OID641291643 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001557657 
Protein GI160878689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAC AAGGATTTAA TCCGTATCTT CCATCATGGG AATATATTCC GGATGGTGAA 
CCTTATATTT TTAATGATAG AGTCTATGTC TATGGCTCTC ATGACCGTTT CAACGGGTAT
GCTTACTGTT TAAATGATTA TGTTTGTTGG TCTACACCGG TGGATGATCT ATCTGACTGG
CGTTGTGAGG GGGTTATTTA CAAAGCTACT GATGTGCAAG ATAATGAAGA CCGCGACAGC
TGTCTTTATG CTCCTGATGT AGCAGTTGGA CCAGACGGAA GATATTATCT ATATTATGTA
GATAGTAAGC GTTCTATAAT ATCTGTTGCT GTTTGTAATA CTCCGGCAGG TAAATACGAA
TTTTATGGTT ATATACATTA TGCAGACGGT GTAAATTTAG GAGAGAGAGA GGGAGATGAA
CCTCAATTCG ATCCTGCGGT ATTGGTAGAG GGAGATAAGG TTTATCTTTA CACCGGCTTT
TGTGCGATTG GTGATAAATC CAGAAGTGGA GCGATGGCGA CGGTTCTTAG TACTGACATG
TTGACCATCA TTGAGGATCC GGTTATTATA GCTCCTAGTG AACCGTATAG TAAGGGAAGC
GGTTTTGAGG GACATGGATT TTTTGAGGCT CCCTCCATTA GAAAACGGGG TAACACTTAT
TATTTTATTT ACTCCTCAAT CCTTATGCAT GAGTTATGCT ATGCTACCAG TAAGCATCCT
ACCATGGGAT TTAAGTATCG CGGAACTATA GTAAGTAATT GTGATATTGG TATTAGCACC
TATAAACCAG CTAACAAACC TATGTATTAC GGAGGAAATA ACCACGGTAG CATTATGGAG
ATAAATGGAA AATGGTATAT TTTCTATCAT AGGCACACCA ATGGTACTAA TTTTAGCAGG
CAGGGTTGCT TAGAAGAAAT TTCTTTTTTA GAAGATGAAT CAATTCCTCA AGTTGAAATA
ACATCATGTG GTCCAAATGG CGGGCCCCTA AAGGGATTGG GCGAGTATCA TGCTTACATA
GCTAGTAATC TGTTCTGCGA TGAGGAATCC ATTTATACAG ATTTTACAGG GGCATGGATG
AATAATCAAT TTCCAAAGAT TACACAGGAT GGAAAAGATG GCGATGAAGA GATTGGTTAT
ATTGCAAATA TGAAAGCATC TGCTACAGCT GGATTTAAAT ATTTTAACTG TATAGGAGTT
AAGAGATTAA AAATAAAAGT GCGCGGCTAT TGTAAAGGTG ATTTTGAAAT AAAGACTTCT
TTAAATGGCC CTGTTCTTGG TAAGATACCG GTTGCATTCT CCAATGTGTG GAAAGAATAC
TCTGCAGATA TAACGATCCC AGATGGAGTA CATGCACTGT ATATTACCTA TACCGGAGAA
GGAAGTGCAA GCCTTGCTTC CTTTACTTTG GAATAA
 
Protein sequence
MRKQGFNPYL PSWEYIPDGE PYIFNDRVYV YGSHDRFNGY AYCLNDYVCW STPVDDLSDW 
RCEGVIYKAT DVQDNEDRDS CLYAPDVAVG PDGRYYLYYV DSKRSIISVA VCNTPAGKYE
FYGYIHYADG VNLGEREGDE PQFDPAVLVE GDKVYLYTGF CAIGDKSRSG AMATVLSTDM
LTIIEDPVII APSEPYSKGS GFEGHGFFEA PSIRKRGNTY YFIYSSILMH ELCYATSKHP
TMGFKYRGTI VSNCDIGIST YKPANKPMYY GGNNHGSIME INGKWYIFYH RHTNGTNFSR
QGCLEEISFL EDESIPQVEI TSCGPNGGPL KGLGEYHAYI ASNLFCDEES IYTDFTGAWM
NNQFPKITQD GKDGDEEIGY IANMKASATA GFKYFNCIGV KRLKIKVRGY CKGDFEIKTS
LNGPVLGKIP VAFSNVWKEY SADITIPDGV HALYITYTGE GSASLASFTL E