Gene Shewmr4_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2136 
Symbol 
ID4252709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2555134 
End bp2557248 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content48% 
IMG OID638118760 
Productalpha amylase, catalytic region 
Protein accessionYP_734266 
Protein GI113970473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATTTTC TATTATCTAA CAGCAGCAAA CGTCGCGCTT ATCGCTGGCA ACAGGGATTT 
AGCATACTGG CCCTCAGCAG CATTAGCACC TTTTGCACCA TGGCCGCACC AGCAATCAGC
ACTCAAGCAA CGGCGGCTAA AACCATGGCT GCCAATCCAA ATGTGGCCGA AGGCGAAATC
CTGCCAGCCC GTCACAATGA CGAGCAAGCA AATAAGTTTA AACCCGTTGT TTATCAAATT
TTTACCCGAC TCTATGGCAA TAAAAACACC ACCAATAAAC CTTGGGGCAC GATTAGCGAA
AACGGTGTGG GTAAATTTAA TGATATTGAT GACATAGCAC TCAAAAGTAT CAAAGACTTA
GGCGTTACCC ATGTGTGGTA TACGGGTGTG CCCCACCACG CCTTAATTGG CGATTACAGT
GCAATTGGCG TAAGTCACGA TGATCCCGAT GTGGTTAAGG GCCGCGCCGG TTCGCCCTAT
GCGGTTAAAG ATTATTACAA CGTAAACCCC GATTTAGCAG TCTACCCCGC CAAGCGCTTA
CAGGAGTTTC AGGCTCTTAT CGAGCGCACC CACAAGCAAG GCTTAAAGGT GATTATCGAT
ATAGTCCCTA ACCATGTGGC GCGTAATTAC CATTCCATCA CTAAGCCCGA GGGCGTGCGT
GATTTTGGTG AAGATGATAA TCAAACCCTT GAATATGAAA GGCATAATAA CTTTTACTAT
GTGACTGATA AAAAGCAATC CTCTGGCTTT CAAGTGCCCG ATTTGCCTGA TACCCTCAAA
CCGTTAGGCG GCGAATCGCA TCCCCTAAGT GATGGTCAAT TTGAAGAGAT CCCCGCCAAA
TGGACTGGCA ACGGCTCACG CCTTGCCAAA CCGGATATGA ATGACTGGTA TGAAACCGTT
AAAATCAATT ACGGTGTCCG CCCCGATGGC AGCCATGATT TCCCCGCACT GCCGCCACGC
TATGCCACAC TCGGCGCCGA GCAGCACTAT GCTTTTTGGC AGCAACATAG CCATGAATTA
CCTAACTCTT GGATCAAGTT CAATCAAATT GCCCAATATT GGTTAGCGAT GGGAGTCGAT
GGATTTCGTT ACGATATGGC CGAAATGGTG CCAGTCGAGT TTTGGAGCTA TTTAAATAGT
CATATAAAAC ATAGCCATCC CGAAGCCTTT ATCTTAGCAG AGGTCTATAA CCCTGCGCTG
TATCGCGACT ATATTCATCT CGGCAAAATG GACTACCTCT ACGACAAGGT CGATCTTTAC
GACACCCTCA AAGCCATTAT GGCAGGACAA AAAAGTACCG CGCAGATCGC CGCGGACCAG
GCCAAAGTGC AAGATATCGA CTCGCATATG CTGCATTTTT TAGAAAACCA CGACGAGCAG
CGCATCGCCA ATGCCGCCTT TTTAGGCGTA TTAACTGGTA ACACCTCGAC AGATGCGGTC
GATCCCCGCT ACGCCCTGCC TGCAATGGTG GTGTCGGCGA CCTTAAGTAC CTCACCCACC
TTGCTTTATT TCGGTCAAGA AGTGGGAGAA GCGGCGACGC AAAACCTAGG CTTTGGCCAT
GCGTCACGCA CCAGTATTTT TGATTATGCG GGTGTTCCCG CCCATCAGCG CTGGATGAAT
CAAGGTAAAT TTGATGGTGG CCAATCAACC GCCGCAGAAG TTGCGCTACG TACCTATTAC
CAAAAATTAT TGAACCTGAG CACGGGGAAA AATGCACCCG CGCTCTTAGG GAAATATCAC
TCGCTAGATG CTGCCAACCG CAGCGCGGTA TCGGCTGCAA AGGCTAGCAA TAAGGCTAGC
AATAAGACAA ACAATGGCAG CGCAACGGGT TATGATGACT CAACCTTTGC CTTTGTCCGC
TTTGAGGCCC ATACAGCCAA TAGCAAAGGT CAAAAGCTGA TTATTGTCAG TAACTTTAGT
CAAACCCAAG CCAAGTTATT TTCCCTTAAA CTCCCCAAAT CTTTGATTGC GCAATGGCAG
TTAACCGATG CAAGCTATCC GCTTAAGGAT TTACTGGAAG AACATACGGC GCAGTTAATT
GTCGAGCGAG GTGAAGGACA GGTTCAGTTG CAGCTTGCAC CTCTCTCCTC CGCGATATTT
GAACTCGTCC ACTAG
 
Protein sequence
MNFLLSNSSK RRAYRWQQGF SILALSSIST FCTMAAPAIS TQATAAKTMA ANPNVAEGEI 
LPARHNDEQA NKFKPVVYQI FTRLYGNKNT TNKPWGTISE NGVGKFNDID DIALKSIKDL
GVTHVWYTGV PHHALIGDYS AIGVSHDDPD VVKGRAGSPY AVKDYYNVNP DLAVYPAKRL
QEFQALIERT HKQGLKVIID IVPNHVARNY HSITKPEGVR DFGEDDNQTL EYERHNNFYY
VTDKKQSSGF QVPDLPDTLK PLGGESHPLS DGQFEEIPAK WTGNGSRLAK PDMNDWYETV
KINYGVRPDG SHDFPALPPR YATLGAEQHY AFWQQHSHEL PNSWIKFNQI AQYWLAMGVD
GFRYDMAEMV PVEFWSYLNS HIKHSHPEAF ILAEVYNPAL YRDYIHLGKM DYLYDKVDLY
DTLKAIMAGQ KSTAQIAADQ AKVQDIDSHM LHFLENHDEQ RIANAAFLGV LTGNTSTDAV
DPRYALPAMV VSATLSTSPT LLYFGQEVGE AATQNLGFGH ASRTSIFDYA GVPAHQRWMN
QGKFDGGQST AAEVALRTYY QKLLNLSTGK NAPALLGKYH SLDAANRSAV SAAKASNKAS
NKTNNGSATG YDDSTFAFVR FEAHTANSKG QKLIIVSNFS QTQAKLFSLK LPKSLIAQWQ
LTDASYPLKD LLEEHTAQLI VERGEGQVQL QLAPLSSAIF ELVH