Gene Sbal223_1731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1731 
Symbol 
ID7088265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2041449 
End bp2042489 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content43% 
IMG OID643460635 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002357659 
Protein GI217972908 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00206644 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAGCG AACTAAATCC ACTAGGCTTA TTAGGTATCG AATTCACTGA ATTCGCCAGC 
CCTGATACTG ATTTTATGCA CAAAGTCTTT ATCGACTTTG GTTTTTCACT GCTGAAAAAA
GCCAAGAACA AAAACATTCT GTACTACAAA CAGAATGACA TTAATTTTCT GCTCAACACT
GAGCGCGAAG GTTTCTCAGC TAAGTTTGCT AAATCCCACG GTCCAGCCAT TTGTTCTATG
GGCTGGCGGG TAGAAGATGC TGGCTTTGCC TATCGCGTTG CCGTTGAACG TGGTGCAAAA
CCCGCCGATG ATGCCAATAA AGATCTGCCT TATCCTGCGA TTTACGGCAT TGGCGACAGC
TTGATTTATT TCATCGACAC CTTTGGCGCA GAAGACAATA TCTATGCTGC TGATTTTGAA
GACTTAGATG AGCAAGTCAT CACCCAAGAG AAAGGCTTTA TCGAAGTCGA TCACTTAACC
AACAACGTCT ACAAAGGCAC AATGGAACAT TGGGCTAACT TCTATAAAAA CATTTTTGGT
TTTACTGAAG TGCGCTACTT TGACATCAGC GGCGTACAAA CTGCGCTAGT GTCATACGCC
CTGCGTTCGC CCGATGGCAG CTTCTGTATT CCGATTAACG AAGGTAAAGG CAGCGATAAG
AACCAAATCG ATGAATACCT GAAGGAATAC AATGGCCCAG GTGTGCAACA TTTAGCCTTT
AGAAGCCGCG ATATCGTAAA ATCTCTGGAT GCGATGGAAG GCAGTTCAAT CCAATGCTTA
GACATTATCC CTGAATACTA CGACACCATA TTTGAGAAGT TGCCTCAAGT GACTGAGAAT
CGTGAACGCA TCAAACATCA CCAAATTTTG GTAGATGGCG ATGAATCCGG TTATTTACTG
CAGATTTTCA CTAAAAACCT GTTCGGACCG ATTTTTATCG AAATAATCCA ACGCAAAAAT
AACTTAGGTT TTGGTGAAGG CAACTTTACT GCCCTGTTTG AATCGATTGA ACGGGATCAA
ATGCGCCGCG GCGTACTGTA A
 
Protein sequence
MASELNPLGL LGIEFTEFAS PDTDFMHKVF IDFGFSLLKK AKNKNILYYK QNDINFLLNT 
EREGFSAKFA KSHGPAICSM GWRVEDAGFA YRVAVERGAK PADDANKDLP YPAIYGIGDS
LIYFIDTFGA EDNIYAADFE DLDEQVITQE KGFIEVDHLT NNVYKGTMEH WANFYKNIFG
FTEVRYFDIS GVQTALVSYA LRSPDGSFCI PINEGKGSDK NQIDEYLKEY NGPGVQHLAF
RSRDIVKSLD AMEGSSIQCL DIIPEYYDTI FEKLPQVTEN RERIKHHQIL VDGDESGYLL
QIFTKNLFGP IFIEIIQRKN NLGFGEGNFT ALFESIERDQ MRRGVL