Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2291 |
Symbol | |
ID | 4252862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2732611 |
End bp | 2733771 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638118916 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_734419 |
Protein GI | 113970626 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000437513 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTTT ATGTGAAACA AGGCCAAGTC CCCCATAAGC GCCATATCGC ATTTGAGAAA GAAAACGGCG AGCTATACCG TGAGGAGCTG TTTTCAACCC ATGGTTTTTC CAATATTTAC TCGAATAAAT ATCACCACAA TATGCCGACT AAGGCCTTGG AAGTGGCACC CTACCGCCTC GGTCACGGTG CCCAATGGGA AGATTCATTA GTTCAAAATT ATAAATTGGA CTCTCGTACG GCCGATCGCG AAGGCAACTT CTTTAGCGCC CGCAATAAAA TCTTTTATAA CAATGATGTG GCCATTTATA CCGCAAAAGT GACTCAAGAC ACGCCGGAGT TTTACCGCAA TGCCTACGCC GATGAAGTGG TGTTTGTACA TGAAGGTGAA GGTACGCTCT ACAGTGAATA TGGCACTCTA GAGATCAAGA AATGGGACTA CTTAGTGATC CCACGCGGCA CCACACATCA GCTCAAATTC AACGATTACA GTAATGTGCG CTTATTTGTG ATTGAAGCCT TTTCAATGGT GGAAGTGCCA AAACATTTCC GTAATGAATA CGGTCAGTTA CTCGAGTCTG CACCCTATTG TGAACGCGAT ATACGCACGC CCGTATTGCA AGATGCCGTG GTTGAACGTG GCGCCTTCCC GCTGGTGTGT AAATTTGGTA ATAAGTACCA ACTGACTACC TTAGAGTGGC ATCCCTTTGA CCTTGTGGGT TGGGACGGCT GTGTTTACCC CTGGGCATTT AACATCACCG AATACGCACC TAAAGTCGGC AAAATTCACT TACCGCCTTC AGACCACTTA GTGTTTACCG CCCACAACTT TGTGGTGTGT AACTTTGTGC CGCGTCCTTA TGACTTCCAC GAGCGTGCCA TTCCTGCGCC TTACTATCAC AACAATATTG ATAGTGATGA AGTGCTGTAC TACGTCGACG GCGACTTTAT GAGTCGCACA GGGATTGAAG CCGGTTACAT CACCCTACAT CAAAAAGGGG TAGCGCACGG TCCACAACCC GGCCGCACCG AAGCCTCGAT AGGCAAAAAA GAAACCTATG AATATGCAGT GATGGTGGAC ACCTTCGCCC CACTGAAATT AACCGAACAT GTGCAAAATT GCATGAGTAA AGACTACAAC CGCTCTTGGC TAGAAAACTA A
|
Protein sequence | MPFYVKQGQV PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYRL GHGAQWEDSL VQNYKLDSRT ADREGNFFSA RNKIFYNNDV AIYTAKVTQD TPEFYRNAYA DEVVFVHEGE GTLYSEYGTL EIKKWDYLVI PRGTTHQLKF NDYSNVRLFV IEAFSMVEVP KHFRNEYGQL LESAPYCERD IRTPVLQDAV VERGAFPLVC KFGNKYQLTT LEWHPFDLVG WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVVC NFVPRPYDFH ERAIPAPYYH NNIDSDEVLY YVDGDFMSRT GIEAGYITLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD TFAPLKLTEH VQNCMSKDYN RSWLEN
|
| |