Gene Shew_2155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_2155 
Symbol 
ID4923327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2499904 
End bp2500944 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content51% 
IMG OID640163740 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001094280 
Protein GI127513083 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.759615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.638305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCG AACAAAATCC ACTGGGATTA CTGGGCATCG AATTTACCGA GTTTTGCACC 
CCAGATCTCG ACTTTATGCA CAAGGTCTTT ATCGACTTTG GTTTCTCTAA GCTGAAGAAG
CATAAGCAAA AAGATATCGT TTACTACAAG CAAAACGACA TCAACTTCCT GCTAAACAAC
GAGAAGAGCG GCTTCTCTGC CGAGTTTGCC AAGAAGCACG GCGCTGCCAT CAGCTCTATG
GGCTGGCGCG TAGAAGATGC CAAATTCGCC TTCGAGGGTG CGGTTGCCCG TGGCGCCAAG
CCTGCAGGTG ATGAGGTGAA AGACCTGCCC TATCCGGCCA TCTACGGCAT CGGCGATAGC
CTGATCTACT TTATCGACAC CTTCGGCGCG GACAACAACA TCTACGCCAC CGACTTTGTC
GATCTTGAAA ACCCAGAAAT CGTACAAGAG AAAGGCTTTA TCGAAGTCGA CCACCTGACC
AACAACGTCT ACAAAGGCAC CATGGAGCAT TGGTCGAACT TCTACAAAGA TATCTTCGGC
TTTACCGAGG TACGCTACTT CGACATCAAG GGTTCGCAGA CGGCACTGAT CTCTTATGCC
CTGCGTTCAC CCGATGGCAG CTTCTGCATC CCAATCAACG AAGGTAAAGG CGACGACAGA
AACCAGATCG ACGAGTATCT GAGAGAATAC GACGGCCCAG GCGTGCAGCA TCTGGCGTTC
CGCAGCCGCG ACATCGTTGC CTCATTGGAT GCGATGGAAG GTAGCTCGAT TCAGACCCTG
GATATCATCC CTGAATACTA CGACACCATC TTCGATAAGC TGCCACAGGT AACCGAAGAC
AGAGAGCGCA TCAAGCATCA CCAGATCCTG GTGGACGGTG ACGAGGACGG CTATCTGCTG
CAGATCTTCA CCAAGAACCT GTTTGGTCCT ATCTTCATCG AGATCATTCA GCGTAAGAAC
AACCTGGGCT TCGGCGAAGG TAACTTCAAG GCACTGTTCG AATCAATCGA GCGTGACCAG
GTGCGTCGCG GCGTACTCTA A
 
Protein sequence
MASEQNPLGL LGIEFTEFCT PDLDFMHKVF IDFGFSKLKK HKQKDIVYYK QNDINFLLNN 
EKSGFSAEFA KKHGAAISSM GWRVEDAKFA FEGAVARGAK PAGDEVKDLP YPAIYGIGDS
LIYFIDTFGA DNNIYATDFV DLENPEIVQE KGFIEVDHLT NNVYKGTMEH WSNFYKDIFG
FTEVRYFDIK GSQTALISYA LRSPDGSFCI PINEGKGDDR NQIDEYLREY DGPGVQHLAF
RSRDIVASLD AMEGSSIQTL DIIPEYYDTI FDKLPQVTED RERIKHHQIL VDGDEDGYLL
QIFTKNLFGP IFIEIIQRKN NLGFGEGNFK ALFESIERDQ VRRGVL