Gene Shewmr4_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1999 
Symbol 
ID4252572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2381573 
End bp2384938 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content50% 
IMG OID638118612 
Productglycoside hydrolase family protein 
Protein accessionYP_734129 
Protein GI113970336 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101187 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000170847 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTGCTTA AACGCTCATT CGTGTCAGCA AGCTTGATCA TGGCACTAGC CTCCTGTGGT 
GGGGACGACA GCAGCTACAG TGCGAACGAC ACAAATACCT TTACCCCTCC CGCCCCTGTG
ACATCGGCAG ACCCAACAAT TGCAGGTGAG GTGAGCTACA AAAACGCAGT AGTTCATGAT
CCTTCCATCA TTAAAGATAC CGATGGTACC TATTATGTTT TCGGATCCCA CTTGGCGGTG
GCCAGTTCTA CCGACTTGAT GAACTGGACA CAAATTGCCT CGGATGGTGC CGCCAAGAGC
AGCCTGTTTA ATACCTATGA GTCAGAAATC GCGGAAGGCA CTGCATGGAC GGGTGGCTTT
GTTGGTTCGT GGGCACCGGA TGTGATAAAG CTTGCCGACG GTAAGTATCA TTTCTATTAC
GACTTCTGTG GCGGCCCCGA TAAGATTGAT TGCGTATCGC GCTCCTATTT GGGGATGGCC
ACCTCGGACA ATATTTTAGG CCCTTATGTT AACCAAGGCT TGATCCTGAA ATCAGGTCAT
GAGGGGGCTG AAAACCCTGG CGCAAACGGT CAAATTTACG ATGGCTTTGT CGACCCTAAC
GCTATCGATC CAGCCGTGTT TTACGATAAA GATGGCGGCC TATGGATGAC CTATGGTTCC
TATTCCGGCG GGATCTGGGT GATGCAGTTA GATCCAGCAA CGGGTAAACC GTTAGCGGAT
CAGGGCTATG GCACTAAGAT CATGGGCGGC AATTACAGTG CGATTGAAGG GTCTTATGTT
ATCTATAGCC CTGAATCCGA ATACTATTAT ATGTTTACCT CCTTTGGTGG CTTCGCCCAA
AAAGACGGTT ACAACATTCG TATCTCTCGC TCTAAAAATC CTAACGGTCC CTATGTGGAT
GCTGCGGGTC TGGACATGAT TGGCGCAACG GCCGCAGGCA ATATCGCCGA CTATGGCGTC
AAACTGATGG GCGGTTTCCA ATTTGTGGCC CACCCGGGCG ATGTGGGTCA TGATCACGGT
TACCTGTCAC CCGGCCACAA CTCCGCTTTT TATGATGCAC AAACGGGCAA ATACTTCTTA
GTATTCCATA CGCGCTTCCC CGATACCGGC GAAGGTCACA GCGTGCGCGT GCATGAGCTA
TTTCTCAACA GCGATGGTTG GCTAGTTGCC TCACCACAGC GTTATGCACC GATCAACGGC
GATAACATTG TCGATGAAAT CGATGTCACT GGTGATTATC AATTTATTAA TCACGCAAAA
AATATCAATA CGGCGGCGCA CACCTCCCTG CACATTAAGT TATCCCGTAC TTGGACCAAT
AAAGGCAGTG TGTCTGGCGA TGTGACTGGT ACTTACCAAC AGGGCGATGA CAACCAAATC
ACCTTAATGC TAGACAATTT AGGCACTTTT GAAGGCGTAC TCGCCTGGCA ATGGGATCCT
GAGCAAAACA AACTCATGCC AACCTTTAGC GCCATTTCTA ATGATGGGGT GAGTATTTGG
GGCGTGAAGC TCACCGATAA CACCACTGAG GAAATTTTAT CGGCATCGGC AAACGGCATT
AGCCTGCCGA CCGAAGCTAC CGAAGGTAAG ATCGCCTTAC CGATACAAGG TACCCGTGGA
TCAACTATCG AGTGGCAATC GAGCGACGAG TCAGTGATCC GTGCCGATGG CACCATTATC
CGACCCAACG TGGGCGAGGG CGATAAGGTC GTGACCTTAA CCGCGACAAT TATGGTCAAT
GGTAAGAAGG TGACTAAAAC CTTCCAAATT ACTGTATTTG CCCATAAAAC CTATAACCGT
ATTGCCCAGT ACAGCTTCGA GAATAACCTC AAGGATTCCT TAGGCTTATT TGGTGATGGT
CAGCCCACTG GCGATAGGAT CTTCAAAGCG GGTGATACCA TTGGCTACGC CACAGGCTTT
GAAGGACAAG CCCTATCATT AGATGGTGCC CACGGGGTGT TACTGCCATC GGGCATTATC
TCTAGCTACG AGTACACAGT GTCTTTTTGG GCAAGTCCTG CAGTGATTAC AGGATTTACC
ACGGCATTCT TCGGCGCAGT TAACGAGCAA ACCGCTGAAG ATGGTAGTAA GTTCTCCAAT
ACTTGGGTCA GCTTGCTGCC ACAGGGCTGG GACGGTAATA CCATGTTCTG GAGCCATAAT
ATCGATACCA GCGGCAGCTC AGTGCTTGAG ACCTGGTTTG ATGGTGTCAC CGGTGAGCGC
ATCGCTGAAA ACACTTGGTC GCATCTGGCG TTCTCAGTCA ATAAAGGGTT GGTAAAAGTC
TTTATCAACG GGGTTGAGCG CTTTAGCAGC GGTAACCTTG CCAACTACTT TACGGGTGCT
CAGGGCGTTT TCGGTCTTGG GGTTAACTAC TGGGATGTTC CATACAACGG CCTTATCGAT
GAGCTAAAAA TATATGAAGC GGCATTAACG GCTGAAGAAG TCAAAGCCCT CGATATCGAC
AAGTTAGCCG ACAGTGAACT CTTGTCCTCA GCAACGGCCA TTCTGGAGCT AGGCGACTTA
TCGGCGGTGC GTGAAAATAT CGAACTGCCA GTAACGGGAC CTTATGCTTC GGCAATCACT
TGGGTCTCTT CCGATCCAAC CATTATCGAT ACCCGCGGCA CGGTCAATCA GCCAGGCCGT
GAGGAAACCG ATAAGGTGGT GACCCTAACG GCGACCTTAA AACTCGGTCA AGCGACACAG
ACGAAAGTCT TCACCGCCAC CGTTAAGTCT AAGGCGCCGC CAACACCGGT CGCAGTCTAT
AGCTTTGAAG ACAATCTTAA CGACAGTACC GCTAACTTCG GCGCGGGCAC TGTAGTGGGT
AACCTGATTG GCGTTGAAGG CGGTAAAATC AGCTATGTGG ATGGCGCGGT AGGCAAGGCG
GCGGTCTTTG ATGGTGCCTC TGGTGTTGTA TTGCCGAATA ACCTTATCAA AGACTACACC
TACTCAGTGT CGATGTGGCT CAACCCTGAG CAGCTGAATA AGTACACTAC AGCCTTGTTT
GGTTATGCGA CCGATTCTAG CTGGACCAGC GTCTTACCAG GCGGTCAAAA TGATTATGAG
CGCATGGTGT TATGGTCAGG CACAGCATGG TACGACGGTA GAACCGGTTT TGTGATGCCA
AAATCCCAAT GGACGCACCT GGCGTACACA GTGAATGGCG GCGATGTGAA GGTCTATATC
AATGGTGAAT TGAAGTTTAC CGGCGCGAAC TTCCCGAACA TCTTCTCAGT ACCAACCACT
AAGTTTGCGG TCGGGGTCAA CTTCTGGGAT ACGCCTTTCA AAGGGGCAAT CGATGAAATC
AAGTTCTATG ACGAAGCGAT TACTGAACAA GATGTAGCTG ATTTATTTGG CGAATCTAAT
CAATAA
 
Protein sequence
MVLKRSFVSA SLIMALASCG GDDSSYSAND TNTFTPPAPV TSADPTIAGE VSYKNAVVHD 
PSIIKDTDGT YYVFGSHLAV ASSTDLMNWT QIASDGAAKS SLFNTYESEI AEGTAWTGGF
VGSWAPDVIK LADGKYHFYY DFCGGPDKID CVSRSYLGMA TSDNILGPYV NQGLILKSGH
EGAENPGANG QIYDGFVDPN AIDPAVFYDK DGGLWMTYGS YSGGIWVMQL DPATGKPLAD
QGYGTKIMGG NYSAIEGSYV IYSPESEYYY MFTSFGGFAQ KDGYNIRISR SKNPNGPYVD
AAGLDMIGAT AAGNIADYGV KLMGGFQFVA HPGDVGHDHG YLSPGHNSAF YDAQTGKYFL
VFHTRFPDTG EGHSVRVHEL FLNSDGWLVA SPQRYAPING DNIVDEIDVT GDYQFINHAK
NINTAAHTSL HIKLSRTWTN KGSVSGDVTG TYQQGDDNQI TLMLDNLGTF EGVLAWQWDP
EQNKLMPTFS AISNDGVSIW GVKLTDNTTE EILSASANGI SLPTEATEGK IALPIQGTRG
STIEWQSSDE SVIRADGTII RPNVGEGDKV VTLTATIMVN GKKVTKTFQI TVFAHKTYNR
IAQYSFENNL KDSLGLFGDG QPTGDRIFKA GDTIGYATGF EGQALSLDGA HGVLLPSGII
SSYEYTVSFW ASPAVITGFT TAFFGAVNEQ TAEDGSKFSN TWVSLLPQGW DGNTMFWSHN
IDTSGSSVLE TWFDGVTGER IAENTWSHLA FSVNKGLVKV FINGVERFSS GNLANYFTGA
QGVFGLGVNY WDVPYNGLID ELKIYEAALT AEEVKALDID KLADSELLSS ATAILELGDL
SAVRENIELP VTGPYASAIT WVSSDPTIID TRGTVNQPGR EETDKVVTLT ATLKLGQATQ
TKVFTATVKS KAPPTPVAVY SFEDNLNDST ANFGAGTVVG NLIGVEGGKI SYVDGAVGKA
AVFDGASGVV LPNNLIKDYT YSVSMWLNPE QLNKYTTALF GYATDSSWTS VLPGGQNDYE
RMVLWSGTAW YDGRTGFVMP KSQWTHLAYT VNGGDVKVYI NGELKFTGAN FPNIFSVPTT
KFAVGVNFWD TPFKGAIDEI KFYDEAITEQ DVADLFGESN Q