Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1731 |
Symbol | |
ID | 7088265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 2041449 |
End bp | 2042489 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643460635 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002357659 |
Protein GI | 217972908 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00206644 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAGCG AACTAAATCC ACTAGGCTTA TTAGGTATCG AATTCACTGA ATTCGCCAGC CCTGATACTG ATTTTATGCA CAAAGTCTTT ATCGACTTTG GTTTTTCACT GCTGAAAAAA GCCAAGAACA AAAACATTCT GTACTACAAA CAGAATGACA TTAATTTTCT GCTCAACACT GAGCGCGAAG GTTTCTCAGC TAAGTTTGCT AAATCCCACG GTCCAGCCAT TTGTTCTATG GGCTGGCGGG TAGAAGATGC TGGCTTTGCC TATCGCGTTG CCGTTGAACG TGGTGCAAAA CCCGCCGATG ATGCCAATAA AGATCTGCCT TATCCTGCGA TTTACGGCAT TGGCGACAGC TTGATTTATT TCATCGACAC CTTTGGCGCA GAAGACAATA TCTATGCTGC TGATTTTGAA GACTTAGATG AGCAAGTCAT CACCCAAGAG AAAGGCTTTA TCGAAGTCGA TCACTTAACC AACAACGTCT ACAAAGGCAC AATGGAACAT TGGGCTAACT TCTATAAAAA CATTTTTGGT TTTACTGAAG TGCGCTACTT TGACATCAGC GGCGTACAAA CTGCGCTAGT GTCATACGCC CTGCGTTCGC CCGATGGCAG CTTCTGTATT CCGATTAACG AAGGTAAAGG CAGCGATAAG AACCAAATCG ATGAATACCT GAAGGAATAC AATGGCCCAG GTGTGCAACA TTTAGCCTTT AGAAGCCGCG ATATCGTAAA ATCTCTGGAT GCGATGGAAG GCAGTTCAAT CCAATGCTTA GACATTATCC CTGAATACTA CGACACCATA TTTGAGAAGT TGCCTCAAGT GACTGAGAAT CGTGAACGCA TCAAACATCA CCAAATTTTG GTAGATGGCG ATGAATCCGG TTATTTACTG CAGATTTTCA CTAAAAACCT GTTCGGACCG ATTTTTATCG AAATAATCCA ACGCAAAAAT AACTTAGGTT TTGGTGAAGG CAACTTTACT GCCCTGTTTG AATCGATTGA ACGGGATCAA ATGCGCCGCG GCGTACTGTA A
|
Protein sequence | MASELNPLGL LGIEFTEFAS PDTDFMHKVF IDFGFSLLKK AKNKNILYYK QNDINFLLNT EREGFSAKFA KSHGPAICSM GWRVEDAGFA YRVAVERGAK PADDANKDLP YPAIYGIGDS LIYFIDTFGA EDNIYAADFE DLDEQVITQE KGFIEVDHLT NNVYKGTMEH WANFYKNIFG FTEVRYFDIS GVQTALVSYA LRSPDGSFCI PINEGKGSDK NQIDEYLKEY NGPGVQHLAF RSRDIVKSLD AMEGSSIQCL DIIPEYYDTI FEKLPQVTEN RERIKHHQIL VDGDESGYLL QIFTKNLFGP IFIEIIQRKN NLGFGEGNFT ALFESIERDQ MRRGVL
|
| |