Gene Shewmr4_2291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2291 
Symbol 
ID4252862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2732611 
End bp2733771 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID638118916 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_734419 
Protein GI113970626 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000437513 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAACA AGGCCAAGTC CCCCATAAGC GCCATATCGC ATTTGAGAAA 
GAAAACGGCG AGCTATACCG TGAGGAGCTG TTTTCAACCC ATGGTTTTTC CAATATTTAC
TCGAATAAAT ATCACCACAA TATGCCGACT AAGGCCTTGG AAGTGGCACC CTACCGCCTC
GGTCACGGTG CCCAATGGGA AGATTCATTA GTTCAAAATT ATAAATTGGA CTCTCGTACG
GCCGATCGCG AAGGCAACTT CTTTAGCGCC CGCAATAAAA TCTTTTATAA CAATGATGTG
GCCATTTATA CCGCAAAAGT GACTCAAGAC ACGCCGGAGT TTTACCGCAA TGCCTACGCC
GATGAAGTGG TGTTTGTACA TGAAGGTGAA GGTACGCTCT ACAGTGAATA TGGCACTCTA
GAGATCAAGA AATGGGACTA CTTAGTGATC CCACGCGGCA CCACACATCA GCTCAAATTC
AACGATTACA GTAATGTGCG CTTATTTGTG ATTGAAGCCT TTTCAATGGT GGAAGTGCCA
AAACATTTCC GTAATGAATA CGGTCAGTTA CTCGAGTCTG CACCCTATTG TGAACGCGAT
ATACGCACGC CCGTATTGCA AGATGCCGTG GTTGAACGTG GCGCCTTCCC GCTGGTGTGT
AAATTTGGTA ATAAGTACCA ACTGACTACC TTAGAGTGGC ATCCCTTTGA CCTTGTGGGT
TGGGACGGCT GTGTTTACCC CTGGGCATTT AACATCACCG AATACGCACC TAAAGTCGGC
AAAATTCACT TACCGCCTTC AGACCACTTA GTGTTTACCG CCCACAACTT TGTGGTGTGT
AACTTTGTGC CGCGTCCTTA TGACTTCCAC GAGCGTGCCA TTCCTGCGCC TTACTATCAC
AACAATATTG ATAGTGATGA AGTGCTGTAC TACGTCGACG GCGACTTTAT GAGTCGCACA
GGGATTGAAG CCGGTTACAT CACCCTACAT CAAAAAGGGG TAGCGCACGG TCCACAACCC
GGCCGCACCG AAGCCTCGAT AGGCAAAAAA GAAACCTATG AATATGCAGT GATGGTGGAC
ACCTTCGCCC CACTGAAATT AACCGAACAT GTGCAAAATT GCATGAGTAA AGACTACAAC
CGCTCTTGGC TAGAAAACTA A
 
Protein sequence
MPFYVKQGQV PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYRL 
GHGAQWEDSL VQNYKLDSRT ADREGNFFSA RNKIFYNNDV AIYTAKVTQD TPEFYRNAYA
DEVVFVHEGE GTLYSEYGTL EIKKWDYLVI PRGTTHQLKF NDYSNVRLFV IEAFSMVEVP
KHFRNEYGQL LESAPYCERD IRTPVLQDAV VERGAFPLVC KFGNKYQLTT LEWHPFDLVG
WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVVC NFVPRPYDFH ERAIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYITLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD
TFAPLKLTEH VQNCMSKDYN RSWLEN