Gene Shewana3_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2083 
Symbol 
ID4476329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2494497 
End bp2496401 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content52% 
IMG OID639726668 
Productglycoside hydrolase family protein 
Protein accessionYP_869719 
Protein GI117920527 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000179948 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000321687 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACTCACA CAATCAACAA ACGAATGGCG ACGCTTGCGC TGGCCATGGG ACTTAGTGCG 
ACCTGCTTGG GGGCAGGTAA CCTATATGCC GCAGAGAGTG CGAAGGGCGC GGATGAAAAC
CGCATTACCG CAGCGACCTT TGCCAATCCG TTATTTCGAA ATGGAGCCGA TCCTTGGCTC
GAATACCACA ATGGTAACTA TTATCTCACC ACCACCACGT GGACCTCTGA GCTGGTGATG
CGTAAATCGC CCACCATTGC AGGGCTTGCC TATGCGCCAG CCCACAATAT TTGGAGTGGC
ACAGATAAAT CCAACTGCTG TAACTTTTGG GCGTTCGAAT TCCACCCACT GCAAACTGCG
CAGGGATTAC GTTGGTATGT AATTTACACC TCGGGCGTGG CAGAAAACTT CGATGGCCAG
CGTAACCATA TCCTCGAGAG TGAAGGCAGC GACCCTATGG GGCCATACAA GTTTAAGGGC
ACACCAATGC CCGACCACTG GAATATCGAC GGCAGCTATT TGGAGTATAA AGGCCAGTTA
TATTTCCTCT GGTCCGAATG GCATGGCCAA GATCAAGTCA ACTTGATTGC CAAGATGAGC
AACCCTTGGA CCGTCGAGGG CGAACATAAG GTGATCACAG CGCCCATTCA CGACTGGGAA
AAATCAGGCT TAAACGTCAA CGAAGGCCCT GAAATCATCC AGCATGAGGG CAGAACTTTC
TTAGTTCACT CGGCAAGCTT TTGTAACACT GAGGATTATT CCTTGGCCGT GGTTGAACTC
ACCGGTGACG ATCCTATGGA TCCCGCCGCA TGGACTAAGT ACGACAAGCC TTTCTTTAGC
AAGGCCAATG GTGTCTATGG CCCTGGCCAC CATGGTTTCT TCAAGTCTCC CGATGGGAAA
GAAGATTGGC TCATTTACCA TGGCAACTCC TCGGCCTCCG ACGGCTGTAG TGGTACCCGT
GCGGCACGTG CTCAACCCTT TACTTGGGAT AACGAAGGCT TGCCTAAATT TGGCGAACCA
ATGGCGGATA AAAAGCAGTT GCCAGTCCCA AGTGGCGAGT TTGGTCCGAT AACCACCCAG
GTGGAAGGCG TGAAATACCG CATCGTGAGC CGTGAAGTCG GTCAATGCCT AGTGACCAAT
GCCAAGGGGC AGGTCAGTGT CGGTAAGTGC GAGGATGACA ACAGCCAATG GGTAATTGAT
CCGAGTAACG ATGGCCTGTA TCGCTTTGCT AATGTGGGTC AGGGAACCTT TTTAACTCAG
GCTCAGTGCC AAGATGAGTC TTCAACGGCA CTGAATACTG CGCCTTGGGT CGCCTCCCGT
TGTCAGCGTT GGTCGGTGGA TTCGACTCGC GAGGGCTGGT TCCGTTTCGC AAACGATCGC
TCCATCGGCA ATCTGCAGGT GAAAAACTGC AGTAAAAAGG CTGGCGCCGA GGTGATTGCC
GGGGAAAACC GTGTCAGTGA ATGCACCGAT TGGCGGATTG AGCCAGTCTC AACATTTGCC
ATAGTCAACG CCCATAGCGG CCGAGTGGTC AGCGCCGAAC AATGTCAGCT TAAACCTAAT
GCCAATGTGG CTCAGTTTGA ATACACCGGC GATGCCTGTC AGCAGTGGCA AGCCATGCCG
ACAACCGATG GATTTTACCG TCTACAATCC ATCCAACGTT CAAACAACAA GGCGCAACAA
TGCCTTGTGA CCAACGAAGG TAATCTGGAG CTAGGGGCTT GTACTGCAAT CGACAGCGAG
TTCCGTAGCG AGTTGATGCC TAATGGCTCA TTAAGGCTAG TGTCCCGTAA GGGCGGTTCG
TCGATGAAAG TGGCCAATGG CTCCTATGCC AATGGCGATA ACATAGTGGA GGACGTGTGG
AAAAACACCA TTTCACAACA GTTCTATTTT AGAGAGGTGA AATAA
 
Protein sequence
MTHTINKRMA TLALAMGLSA TCLGAGNLYA AESAKGADEN RITAATFANP LFRNGADPWL 
EYHNGNYYLT TTTWTSELVM RKSPTIAGLA YAPAHNIWSG TDKSNCCNFW AFEFHPLQTA
QGLRWYVIYT SGVAENFDGQ RNHILESEGS DPMGPYKFKG TPMPDHWNID GSYLEYKGQL
YFLWSEWHGQ DQVNLIAKMS NPWTVEGEHK VITAPIHDWE KSGLNVNEGP EIIQHEGRTF
LVHSASFCNT EDYSLAVVEL TGDDPMDPAA WTKYDKPFFS KANGVYGPGH HGFFKSPDGK
EDWLIYHGNS SASDGCSGTR AARAQPFTWD NEGLPKFGEP MADKKQLPVP SGEFGPITTQ
VEGVKYRIVS REVGQCLVTN AKGQVSVGKC EDDNSQWVID PSNDGLYRFA NVGQGTFLTQ
AQCQDESSTA LNTAPWVASR CQRWSVDSTR EGWFRFANDR SIGNLQVKNC SKKAGAEVIA
GENRVSECTD WRIEPVSTFA IVNAHSGRVV SAEQCQLKPN ANVAQFEYTG DACQQWQAMP
TTDGFYRLQS IQRSNNKAQQ CLVTNEGNLE LGACTAIDSE FRSELMPNGS LRLVSRKGGS
SMKVANGSYA NGDNIVEDVW KNTISQQFYF REVK