Gene Shewmr4_2292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2292 
Symbol 
ID4252863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2733885 
End bp2734925 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content45% 
IMG OID638118917 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_734420 
Protein GI113970627 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000761177 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCG AACTCAATCC ACTGGGCTTA CTCGGTATCG AATTTACCGA ATTTGCAAGC 
CCCGATAGCG ATTTTATGCA CAAAGTGTTT ATCGACTTTG GTTTTTCACT GCTGAAAAAA
GCCAAAAATA AAGACATTTT GTACTACAAA CAAAATGACA TTAACTTCCT GCTCAATAAT
GAGCGCGAAG GGTTTTCAGC AGAATTTGCC AAATCCCACG GCCCTGCTAT TAGCTCTATG
GGCTGGCGCG TAGAAGATGC CAGCTTTGCT CACCGTGTGG CGGTAGAGCG CGGCGCCAAA
GCCGTGGCTG ATTCGGCCAA GGATCTGCCT TATCCAGCCA TTTATGGTAT TGGTGACAGC
TTAATTTATT TTATCGACAC CTTCGGTGCC GACAACAATA TCTATGCAAC TGACTTTGAA
GACTTAAGTG AGCCTGTGAT CACTCAAGAG AAAGGCTTTG TCGAAGTAGA CCACTTAACC
AATAACGTCT ACAAAGGCAC CATGGAACAT TGGGCGAACT TCTACAAAAA CATCTTTGGT
TTTACCGAAG TGCGCTATTT CGACATCAGC GGCGTACAAA CCGCATTGGT GTCTTACGCC
CTGCGCTCAC CCGATGGCAG CTTCTGTATC CCGATTAACG AAGGTAAAGG CAACGATAAG
AACCAAATCG ATGAATACCT GAAGGAATAC AATGGTCCAG GTGTACAACA CTTAGCCTTT
AGAAGCCGTG ATATCGTTAA ATCCTTGGAT GCGATGGAAG GTAGCTCGAT TCAATGCTTG
GATATTATTC CCGAATATTA CGACACCATT TTCGATAAAG TCCCACAAGT GACCGAAAAC
CGTGAGCGTA TCAAGCATCA CCAAATTTTG GTGGACGGCG ACGAGTCAGG CTATTTATTA
CAGATCTTCA CTAAAAACCT GTTTGGCCCG ATCTTTATCG AAATCATTCA ACGTAAGAAC
AACTTAGGTT TTGGCGAAGG TAACTTCACC GCCCTGTTCC AATCGATTGA ACGTGACCAA
ATGCGCCGCG GCGTGCTGTA A
 
Protein sequence
MASELNPLGL LGIEFTEFAS PDSDFMHKVF IDFGFSLLKK AKNKDILYYK QNDINFLLNN 
EREGFSAEFA KSHGPAISSM GWRVEDASFA HRVAVERGAK AVADSAKDLP YPAIYGIGDS
LIYFIDTFGA DNNIYATDFE DLSEPVITQE KGFVEVDHLT NNVYKGTMEH WANFYKNIFG
FTEVRYFDIS GVQTALVSYA LRSPDGSFCI PINEGKGNDK NQIDEYLKEY NGPGVQHLAF
RSRDIVKSLD AMEGSSIQCL DIIPEYYDTI FDKVPQVTEN RERIKHHQIL VDGDESGYLL
QIFTKNLFGP IFIEIIQRKN NLGFGEGNFT ALFQSIERDQ MRRGVL