Gene Shewmr4_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1991 
Symbol 
ID4252564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2367618 
End bp2368619 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content49% 
IMG OID638118604 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_734121 
Protein GI113970328 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.535361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0025818 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAAGAG CAACAAACAG CCCCCTAGTC GAGCAGCGCG CCGATCCCTT TGTGTACCTA 
CATAGCGACG GTTATTACTA CTTTACGGGC TCTGTTCCGA CCTACGATCG GATTGAACTG
CGTAAATCCA AGACCTTAGA CGGCTTAAAA GACGCACAAA CCTTCGATAT CTGGTTTAAA
CACCAAAGCG GCCCAATGAG CCGCCACGTA TGGGCGCCCG AGATCCATTA TCTCGACGGC
AAATGGTATA TCTACTTTGC GGCAAGTGAA GAGGAAAATA TTTGGGCCTT ACGCCCCTAT
GTGCTTGAGT GTCTAGGACA AGATCCATTA AATGATGAAT GGATTGAACT TGGCATGATG
CAGGCGGCTG AGGGTGATAA TAAGTCCTTT ATCGACTTTT CCTTAGATGC GACCATTTTC
GAAAACAACG GTAAGCGTTA CTTCTGCTGG GCGGAGAAAA CCGGTGGACA ATTTGCGGCA
TCCAACCTGT ATCTTGCGGA AATGGCATCG CCCATCAAGT TAAAGACGGC GCAATTTATG
CTCACCACCC CAGATTATGA TTGGGAGCGC GTCGATTTTT GGGTTAACGA AGGGCCTGCA
GTACTTAAAC ACCAAGGTAA AATCTTCATT ACTTTCTCAG CCAGCGCCAC AGGTGCTTGT
TATTGCATGG GTTATATGGA GGCCGATGAG CATGCTGATC TGCTCGATCG TAACTCATGG
AAGAAAACCC GCCAGCCAGT GCTGTGCACA GATGTCGACA AGCAAATATT CGGCCCTGGT
CATAACAGTT TCACCGTGGC AGAAGATGGC GTGACGCCCA TCTGTGTTTA CCATGCCCGT
GATTATGAAC ATGCGGTGGG CGATCCTAGC GTGGTGCCAA AGACAGATAC CCGTCCATTA
GCGCAAATCA TTAAAGATCC ACTCTATGAT CCCAATCGCC ATGCGCGGAC GTTAGCGGTG
TCATTTGATG AGAGCGGGCG CCCCTTGTTC AACCTCTACT AA
 
Protein sequence
MIRATNSPLV EQRADPFVYL HSDGYYYFTG SVPTYDRIEL RKSKTLDGLK DAQTFDIWFK 
HQSGPMSRHV WAPEIHYLDG KWYIYFAASE EENIWALRPY VLECLGQDPL NDEWIELGMM
QAAEGDNKSF IDFSLDATIF ENNGKRYFCW AEKTGGQFAA SNLYLAEMAS PIKLKTAQFM
LTTPDYDWER VDFWVNEGPA VLKHQGKIFI TFSASATGAC YCMGYMEADE HADLLDRNSW
KKTRQPVLCT DVDKQIFGPG HNSFTVAEDG VTPICVYHAR DYEHAVGDPS VVPKTDTRPL
AQIIKDPLYD PNRHARTLAV SFDESGRPLF NLY