Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_2615 |
Symbol | |
ID | 4842933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 3058300 |
End bp | 3059460 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640119852 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_001050977 |
Protein GI | 126174828 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0308121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTT ATGTGAAACA AGGCCAGATC CCCCATAAGC GCCATATCGC ATTTGAAAAG GAAAACGGCG AGCTTTACCG TGAAGAGTTG TTCTCAACCC ATGGTTTTTC CAATATTTAT TCCAATAAAT ATCACCACAA CATGCCAACC AAAGCGTTGG AAGTAGCACC TTACAACCTA GGCCACGGTG CCAATTGGGA AGACTCGCTG GTCCAAAACT ATAAACTCGA TTCCCGCGAT GCCGACCGTG AAGGTAATTT TTTCAGCGCC CGTAACAAGA TTTTTTATAA CAATGACTTA GCACTTTATA CCGCCAAAGT CACCGCCGAT ACCGACGAGT TTTACCGTAA TGCCTATGCC GATGAGGTGG TTTTTGTCCA CGAAGGCGAA GGCACACTCT ACAGCGAGTA CGGCACGATT AAGGTTCAAA AGTGGGACTA TCTCGTGATC CCACGCGGCA CGACTTATCA ACTTAAATTC AACGATTACA GCAATGCGCG ACTTTTCGTT ATCGAGTCAT TCTCTATGGT GGAAGTGCCT AAGCATTTCC GTAACGAGTA TGGTCAGCTA TTAGAATCGG CACCTTATTG CGAGCGCGAT TTGCGCGTGC CCACATTGCA AGATGCTGTG GTTGAGCGCG GCGCCTTCCC CTTAGTGTGT AAGTTTGGCG ATAAATACCA ACTCACCACA CTTGAATGGC ATCCCTTTGA TTTAGTGGGT TGGGATGGCT GTGTTTACCC TTGGGCATTC AACATTACTG AATACGCGCC AAAGGTGGGC AAAATCCATC TGCCGCCTTC TGATCACTTA GTCTTTACTG CCCATAACTT TGTGATTTGT AACTTCGTGC CGCGCCCTTA CGATTTCCAC CCGAAATCGA TTCCAGCGCC TTATTACCAC AACAATATCG ATAGCGACGA AGTTTTGTAT TACGTCGATG GCGACTTTAT GAGCCGCACG GGCATTGAAG CGGGTTACAT GACCTTACAT CAAAAAGGTG TGGCCCACGG CCCACAACCG GGTCGCACTG AAGCCTCGAT TGGCAAGAAA GAAACCTATG AATACGCAGT GATGGTGGAC ACGTTCGCCC CACTTAAATT AACCGAACAT GTACAAAATT GCATGAGCAA AGACTACAAC CGCTCTTGGC TAGAAGACTA G
|
Protein sequence | MPFYVKQGQI PHKRHIAFEK ENGELYREEL FSTHGFSNIY SNKYHHNMPT KALEVAPYNL GHGANWEDSL VQNYKLDSRD ADREGNFFSA RNKIFYNNDL ALYTAKVTAD TDEFYRNAYA DEVVFVHEGE GTLYSEYGTI KVQKWDYLVI PRGTTYQLKF NDYSNARLFV IESFSMVEVP KHFRNEYGQL LESAPYCERD LRVPTLQDAV VERGAFPLVC KFGDKYQLTT LEWHPFDLVG WDGCVYPWAF NITEYAPKVG KIHLPPSDHL VFTAHNFVIC NFVPRPYDFH PKSIPAPYYH NNIDSDEVLY YVDGDFMSRT GIEAGYMTLH QKGVAHGPQP GRTEASIGKK ETYEYAVMVD TFAPLKLTEH VQNCMSKDYN RSWLED
|
| |