Gene Shewmr7_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr7_2363 
Symbol 
ID4256969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-7 
KingdomBacteria 
Replicon accessionNC_008322 
Strand
Start bp2799621 
End bp2800781 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID638123033 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_738407 
Protein GI114047857 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.445339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.275941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTT ATGTGAAACA AGGCCAAGTC CCCCATAAGC GCCATATCGC ATTTGAGAAA 
GAAAACGGCG AGCTATACCG TGAGGAGCTG TTTTCAACCC ATGGTTTTTC CAATATTTAT
TCCAATAAAT ATCACCACAA TATGCCGACT AAGGCATTGG AAGTGGCGCC CTACCGCCTC
GGTCACGGTG CCCAATGGGA AGATTCATTA GTTCAAAATT ATAAATTGGA CTCTCGTACG
GCCGATCGTG AAGGCAACTT CTTTAGCGCC CGCAATAAAA TCTTTTATAA CAATGATGTG
GCTATTTATA CCGCAAAAGT GACTCAAGAC ACGTCGGAGT TTTACCGCAA TGCCTACGCC
GATGAAGTGG TGTTTGTGCA CGAAGGTGAA GGCACACTCT ACAGTGAATA TGGCACCCTA
GAGATCAAGA AATGGGACTA CTTAGTGATC CCACGCGGCA CCACACATCA GCTCAAATTC
AACGATTACA GTAATGTGCG CTTATTTGTG ATTGAAGCCT TTTCAATGGT GGAAGTGCCA
AAACATTTCC GTAATGAATA CGGTCAGTTA CTCGAGTCTG CTCCCTATTG TGAACGCGAT
CTACGCACGC CCGTATTGCA AGATGCCGTG GTTGAACGTG GCGCCTTCCC GCTGGTGTGT
AAATTTGGTG ATAAGTACCA ACTGACCACC TTAGAGTGGC ATCCCTTTGA CCTTGTGGGT
TGGGACGGCT GTGTTTACCC CTGGGCATTT AACATCACCG AATACGCACC TAAAGTCGGC
AAAATTCACT TACCGCCTTC AGACCACTTA GTGTTTACCG CCCACAACTT TGTGGTGTGT
AACTTTGTGC CGCGTCCTTA TGACTTCCAC GAGCGTGCCA TTCCTGCGCC TTACTATCAC
AACAATATTG ATAGTGATGA AGTGCTGTAC TACGTCGACG GTGACTTTAT GAGTCGTACA
GGGATTGAAG CCGGTTACAT CACCCTACAT CAAAAAGGGG TAGCGCACGG CCCACAACCC
GGCCGCACCG AAGCCTCGAT TGGCAAAAAA CAAACCTATG AATATGCAGT GATGGTGGAC
ACCTTCGCCC CACTGAAATT AACCGAACAT GTGCAAAATT GCATGAGTAA AGACTACAAC
CGCTCTTGGC TAGAAAACTA A
 
Protein sequence
MPFYVKQGQV PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYRL 
GHGAQWEDSL VQNYKLDSRT ADREGNFFSA RNKIFYNNDV AIYTAKVTQD TSEFYRNAYA
DEVVFVHEGE GTLYSEYGTL EIKKWDYLVI PRGTTHQLKF NDYSNVRLFV IEAFSMVEVP
KHFRNEYGQL LESAPYCERD LRTPVLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG
WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVVC NFVPRPYDFH ERAIPAPYYH
NNIDSDEVLY YVDGDFMSRT GIEAGYITLH QKGVAHGPQP GRTEASIGKK QTYEYAVMVD
TFAPLKLTEH VQNCMSKDYN RSWLEN