Gene Shewmr4_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1996 
Symbol 
ID4252569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2374981 
End bp2376885 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content51% 
IMG OID638118609 
Productglycoside hydrolase family protein 
Protein accessionYP_734126 
Protein GI113970333 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000718487 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000398389 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTCACA CAATCAACAA ACGAATGGCT ACGCTTGCGC TGGCCATGGG ACTTAGTGCG 
ACCTGCTTGG GGGCAGGTAA CCTACATGCT GCAGAGAGTG CGAAGGGCGT GGATGGAAAC
CGCATTACCG CTGCGACCTT TGCGAATCCG TTATTTCGAA ATGGAGCCGA TCCTTGGCTC
GAATACCACA ATGGTAACTA TTACCTCACC ACCACCACGT GGACCTCTGA GCTGGTGATG
CGTAAATCGC CCACCATTGC AGGGCTTGCC GATGCGCCGG CCCACAATAT TTGGAGCGGC
ACCGATAAGT CCAACTGCTG TAATTTTTGG GCATTCGAAT TCCACCCTCT GCAAACTGCG
CAGGGATTAC GTTGGTATGT AATTTACACC TCGGGCGTGG CAGAAAACTT CGATGGCCAG
CGTAACCATA TCCTCGAGAG TGAAGGCAGT GACCCTATGG GGCCATACAA GTTTAAGGGC
ACGCCAATGC CCGACCACTG GAATATCGAC GGCAGCTATT TGGAGTATAA AGGCCAGCTA
TATTTCCTCT GGTCCGAATG GCACGGCCAA GATCAAGTCA ACTTGATTGC CAAGATGAGT
AACCCTTGGA CCGTCGAGGG CGAACATAAG GTGATCACAG CGCCCATTCA CGACTGGGAA
AAATCAGGCT TAAACGTCAA CGAAGGCCCT GAAATCATCC AGCATGAGGG CAGAACCTTT
TTAGTTCACT CGGCAAGCTT TTGTAATACT GAGGATTATT CCTTAGCCGT GGTTGAACTC
ACAGGTGACG ATCCTATGGA TCCCGCCGCA TGGACTAAGT ACGACAAGCC TTTCTTTAGC
AAAGCCAATG GCGTCTATGG CCCTGGCCAC CATGGTTTCT TCAAGTCTCC CGATGGAAAA
GAAGATTGGC TCATTTACCA TGGCAACTCC TCGGCCTCAG ACGGCTGTAG TGGTACCCGA
GCGGCACGTG CTCAACCCTT TACTTGGGAT AACAAAGGCT TGCCTAAATT TGGCGAACCA
TTGGCGGATA AAAAGCAATT GCCAGTCCCA AGTGGCGAGT TTGGCCCGAT AACCACTCAA
GTGGAAGGCG TGAAATACCG CATCGTGAGC CGTGAAGTCG GTCAATGCCT AGTGACCAAT
GCCAAGGGCC AGGTCAGTGT CGGTAAGTGC GAGGATGACA ACAGCCAATG GGTAATTGAT
CCGAGTAACG ATGGCCTGTA TCGCTTTGCT AATGTGGGTC AGGGAACCTT TTTAACTCAG
GCTCTGTGCC AAGATGAGTC TTCAACGGCA CTGAATTCGG CGCCTTGGGT CGCCTCCCGT
TGTCAGCGTT GGTCGGTGGA TTCGACCCGT GAAGGCTGGT TCCGTTTCGC AAACGATCGC
TCCATTGGCA ATCTGCAGGT GAAAAACTGC AGTAAAAAGG CCGGCGCAGA GGTGATTGCG
GGGGAAAACC GTGTCAGTGA ATGCACCGAT TGGCGGATTG AGCCAGTCTC CACATTTGCC
ATAGTCAACG CCCATAGCGG CCGAGTGGTT AGCGCCGAAC AATGTCAGCT TAAACCTAAT
GCCAATGTGG CTCAGTTTGA ATACACCGGC GATGCCTGTC AGCAGTGGCA AGCAATGCCG
ACAACCGATG GATTTTACCG TCTGCAATCC ATCCAACTTT CAAACAACAA GGCGCAACAA
TGCCTTGTGA CCAACGAAGG TAATCTGGAG CTAGGGGCGT GTAATGCAAT CGACAGCGAG
TTCCGTAGCG AGTTGATGCC AAATGGATCA TTAAGGCTAG TGTCTCGCAA GGGTGGTTCG
TCCATGAAAG TGGCCAATGG CTCCTATGCC AATGGCGATA ACATAGTGGA AGACGTGTGG
AAAAACACCA TTTCACAACA GTTCTATTTT AGAGAGGTGA AATAA
 
Protein sequence
MTHTINKRMA TLALAMGLSA TCLGAGNLHA AESAKGVDGN RITAATFANP LFRNGADPWL 
EYHNGNYYLT TTTWTSELVM RKSPTIAGLA DAPAHNIWSG TDKSNCCNFW AFEFHPLQTA
QGLRWYVIYT SGVAENFDGQ RNHILESEGS DPMGPYKFKG TPMPDHWNID GSYLEYKGQL
YFLWSEWHGQ DQVNLIAKMS NPWTVEGEHK VITAPIHDWE KSGLNVNEGP EIIQHEGRTF
LVHSASFCNT EDYSLAVVEL TGDDPMDPAA WTKYDKPFFS KANGVYGPGH HGFFKSPDGK
EDWLIYHGNS SASDGCSGTR AARAQPFTWD NKGLPKFGEP LADKKQLPVP SGEFGPITTQ
VEGVKYRIVS REVGQCLVTN AKGQVSVGKC EDDNSQWVID PSNDGLYRFA NVGQGTFLTQ
ALCQDESSTA LNSAPWVASR CQRWSVDSTR EGWFRFANDR SIGNLQVKNC SKKAGAEVIA
GENRVSECTD WRIEPVSTFA IVNAHSGRVV SAEQCQLKPN ANVAQFEYTG DACQQWQAMP
TTDGFYRLQS IQLSNNKAQQ CLVTNEGNLE LGACNAIDSE FRSELMPNGS LRLVSRKGGS
SMKVANGSYA NGDNIVEDVW KNTISQQFYF REVK