Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1732 |
Symbol | |
ID | 7088266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 2042587 |
End bp | 2043747 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643460636 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_002357660 |
Protein GI | 217972909 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00561061 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCATTTT ATGTGAAACA AGGCCAGATC CCCCGTAAGC GCCATATCGC ATTTGAAAAG GAAAACGGCG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGTTTTTC CAATATTTAT TCCAATAAAT ATCACCACAA CATGCCAACC AAAGCGTTGG AAGTGGCTCC TTACAGCCTA GGCCACGGTG CCAATTGGGA AGACTCACTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT GCCGACCGTG AAGGTAATTT TTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGACTTA GCACTGTATA CCGCCAAAGT CACCGCCGAT ACCAACGAGT TTTACCGTAA CGCCTATGCC GATGAAGTGG TTTTTGTTCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT AAGGTCCAAA AGTGGGACTA TCTCGTGATC CCACGTGGCA CGACTTATCA ACTTAAATTC AACGATTACA GCAATGTGCG ACTTTTCGTT ATCGAGTCAT TCTCTATGGT GGAAGTGCCT AAGCATTTCC GTAACGAGTA TGGTCAGCTA TTAGAATCGG CACCTTACTG CGAGCGCGAT TTGCGCGTGC CCACATTGCA AGATGCTGTG GTTGAGCGCG GCGCCTTCCC TTTAGTGTGT AAGTTTGGCG ATAAGTACCA ACTCACCACA CTTGAATGGC ATCCCTTTGA TTTAGTGGGT TGGGATGGCT GTGTTTACCC TTGGGCATTC AACATTACTG AATACGCGCC AAAGGTGGGC AAAATCCATC TGCCGCCTTC TGATCACTTG GTCTTTACTG CCCATAACTT TGTGATTTGT AACTTCGTGC CGCGCCCTTA CGATTTCCAC CCGAAATCGA TTCCTGCGCC TTATTACCAC AACAACATCG ATAGCGACGA AGTTTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG GGCATTGAAG CGGGTTACAT GACCTTACAT CAAAAAGGTG TGGCCCACGG CCCACAACCG GGTCGCACTG AAGCCTCGAT TGGCAAGAAA GAAACCTATG AATACGCAGT GATGGTCGAC ACGTTCGCCC CACTTAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC CGCTCTTGGC TAGAAGACTA G
|
Protein sequence | MPFYVKQGQI PRKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYSL GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TNEFYRNAYA DEVVFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNVRLFV IESFSMVEVP KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD TFAPLKLTEH VQNCMSKDYN RSWLED
|
| |