Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew185_2653 |
Symbol | |
ID | 5371968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS185 |
Kingdom | Bacteria |
Replicon accession | NC_009665 |
Strand | + |
Start bp | 3160394 |
End bp | 3161554 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640830877 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_001366851 |
Protein GI | 153001170 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.51516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTT ATGTGAAACA AGGCCAGATC CCCCATAAGC GCCATATCGC ATTTGAAAAG GAGAACGGCG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGTTTTTC CAATATTTAT TCCAATAAAT ATCACCACAA CATGCCAACC AAAGCGTTGG AAGTGGCTCC TTACAGCCTA GGCCACGGTG CCAATTGGGA AGACTCGCTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT GCCGACCGTG AAGGTAATTT TTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGACTTA GCACTTTATA CCGCCAAAGT CACCGCCGAT ACCGACGAGT TTTACCGTAA TGCCTATGCC GATGAGGTGC TTTTTGTCCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGCGGCA CGACTTATCA ACTTAAATTC AACGATTACA GCAATGTGCG ACTTTTCGTT ATCGAGTCAT TCTCTATGGT GGAAGTGCCT AAGCATTTCC GTAACGAGTA TGGTCAGCTA TTAGAATCGG CACCTTACTG CGAGCGCGAT TTGCGCGTGC CCACATTGCA AGATGCTGTG GTTGAGCGCG GCGCCTTCCC CTTAATATGT AAGTTTGGCG ATAAGTACCA ACTCACCACA CTAGAATGGC ATCCCTTTGA TTTAGTGGGT TGGGATGGCT GTGCTTACCC TTGGGCATTC AACATTACTG AATACGCGCC AAAGGTGGGC AAAATCCATC TGCCGCCTTC TGATCACTTA GTCTTTACCG CCCATAACTT TGTGATTTGT AACTTCGTGC CGCGTCCCTA CGATTTCCAC CCGAAATCGA TTCCGGCGCC TTATTACCAC AACAATATCG ATAGCGACGA AGTCTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG GGCATTGAAG CGGGTTATAT GACCTTACAT CAAAAAGGTG TGGCCCACGG CCCACAACCG GGCCGCACTG AAGCCTCGAT TGGCAAGAAA GAAACCTATG AATACGCAGT GATGGTCGAC ACCTTCGCCC CACTTAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC CGCTCTTGGC TAGAAGACTA G
|
Protein sequence | MPFYVKQGQI PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TDEFYRNAYA DEVLFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLIC KFGDKYQLTT LEWHPFDLVG WDGCAYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD TFAPLKLTEH VQNCMSKDYN RSWLED
|
| |