Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_2729 |
Symbol | |
ID | 5754502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 3235786 |
End bp | 3236826 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641289037 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001555157 |
Protein GI | 160875841 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.238674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.243491 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGCG AACTAAATCC ACTGGGCTTA TTAGGTATCG AATTCACTGA ATTCGCCAGC CCTGATACTG ATTTTATGCA CAAAGTCTTT ATAGACTTTG GTTTTTCACT GCTGAAAAAA GCCAAGAACA AAAACATTCT GTACTACAAA CAGAATGACA TTAATTTTCT GCTCAACACT GAGCGCGAAG GTTTCTCTGC TAAGTTTGCT AAATCCCACG GCCCAGCCAT TTGTTCTATG GGCTGGCGGG TAGAAGATGC CGGCTTTGCC TATCGCGTTG CCGTTGAACG TGGTGCAAAA CCCGCCGATG ATGCCAATAA AGATCTGCCT TATCCTGCGA TTTACGGCAT TGGCGACAGC TTGATTTATT TCATCGACAC CTTTGGCGCA GATAACAATA TCTATGCCGC TGATTTTGAA GACTTAGACG AGCAAGTGAT CACCCAAGAG AAAGGCTTTA TCGAAGTCGA TCACTTAACC AACAACGTTT ACAAAGGCAC AATGGAACAT TGGGCTAACT TCTATAAAAA CATTTTTGGT TTTACTGAAG TGCGCTACTT TGACATCAGC GGCGTACAAA CCGCGCTAGT GTCCTACGCC CTGCGTTCGC CCGATGGCAG CTTCTGTATT CCGATTAACG AAGGTAAAGG CAGCGATAAG AACCAAATCG ATGAATACCT GAAGGAATAC AATGGCCCAG GTGTGCAGCA TTTAGCCTTT AGAAGTCGCG ATATCGTAAA ATCTCTGGAT GCGATGGAAG GCAGTTCAAT CCAATGCTTA GACATTATCC CTGAATACTA CGACACCATA TTTGAGAAGT TACCGCAGGT GACTGAGAAT CGTGAGCGCA TCAAACATCA CCAAATTTTG GTGGATGGCG ATGAATCCGG TTATTTACTG CAAATTTTCA CTAAAAACCT GTTCGGACCG ATTTTTATCG AAATCATCCA ACGCAAGAAT AACTTAGGTT TTGGTGAAGG CAACTTTACT GCCCTGTTTG AATCGATTGA GCGGGATCAA ATGCGTCGCG GCGTACTGTA A
|
Protein sequence | MASELNPLGL LGIEFTEFAS PDTDFMHKVF IDFGFSLLKK AKNKNILYYK QNDINFLLNT EREGFSAKFA KSHGPAICSM GWRVEDAGFA YRVAVERGAK PADDANKDLP YPAIYGIGDS LIYFIDTFGA DNNIYAADFE DLDEQVITQE KGFIEVDHLT NNVYKGTMEH WANFYKNIFG FTEVRYFDIS GVQTALVSYA LRSPDGSFCI PINEGKGSDK NQIDEYLKEY NGPGVQHLAF RSRDIVKSLD AMEGSSIQCL DIIPEYYDTI FEKLPQVTEN RERIKHHQIL VDGDESGYLL QIFTKNLFGP IFIEIIQRKN NLGFGEGNFT ALFESIERDQ MRRGVL
|
| |