Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2292 |
Symbol | |
ID | 4252863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2733885 |
End bp | 2734925 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 638118917 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_734420 |
Protein GI | 113970627 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000761177 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGCG AACTCAATCC ACTGGGCTTA CTCGGTATCG AATTTACCGA ATTTGCAAGC CCCGATAGCG ATTTTATGCA CAAAGTGTTT ATCGACTTTG GTTTTTCACT GCTGAAAAAA GCCAAAAATA AAGACATTTT GTACTACAAA CAAAATGACA TTAACTTCCT GCTCAATAAT GAGCGCGAAG GGTTTTCAGC AGAATTTGCC AAATCCCACG GCCCTGCTAT TAGCTCTATG GGCTGGCGCG TAGAAGATGC CAGCTTTGCT CACCGTGTGG CGGTAGAGCG CGGCGCCAAA GCCGTGGCTG ATTCGGCCAA GGATCTGCCT TATCCAGCCA TTTATGGTAT TGGTGACAGC TTAATTTATT TTATCGACAC CTTCGGTGCC GACAACAATA TCTATGCAAC TGACTTTGAA GACTTAAGTG AGCCTGTGAT CACTCAAGAG AAAGGCTTTG TCGAAGTAGA CCACTTAACC AATAACGTCT ACAAAGGCAC CATGGAACAT TGGGCGAACT TCTACAAAAA CATCTTTGGT TTTACCGAAG TGCGCTATTT CGACATCAGC GGCGTACAAA CCGCATTGGT GTCTTACGCC CTGCGCTCAC CCGATGGCAG CTTCTGTATC CCGATTAACG AAGGTAAAGG CAACGATAAG AACCAAATCG ATGAATACCT GAAGGAATAC AATGGTCCAG GTGTACAACA CTTAGCCTTT AGAAGCCGTG ATATCGTTAA ATCCTTGGAT GCGATGGAAG GTAGCTCGAT TCAATGCTTG GATATTATTC CCGAATATTA CGACACCATT TTCGATAAAG TCCCACAAGT GACCGAAAAC CGTGAGCGTA TCAAGCATCA CCAAATTTTG GTGGACGGCG ACGAGTCAGG CTATTTATTA CAGATCTTCA CTAAAAACCT GTTTGGCCCG ATCTTTATCG AAATCATTCA ACGTAAGAAC AACTTAGGTT TTGGCGAAGG TAACTTCACC GCCCTGTTCC AATCGATTGA ACGTGACCAA ATGCGCCGCG GCGTGCTGTA A
|
Protein sequence | MASELNPLGL LGIEFTEFAS PDSDFMHKVF IDFGFSLLKK AKNKDILYYK QNDINFLLNN EREGFSAEFA KSHGPAISSM GWRVEDASFA HRVAVERGAK AVADSAKDLP YPAIYGIGDS LIYFIDTFGA DNNIYATDFE DLSEPVITQE KGFVEVDHLT NNVYKGTMEH WANFYKNIFG FTEVRYFDIS GVQTALVSYA LRSPDGSFCI PINEGKGNDK NQIDEYLKEY NGPGVQHLAF RSRDIVKSLD AMEGSSIQCL DIIPEYYDTI FDKVPQVTEN RERIKHHQIL VDGDESGYLL QIFTKNLFGP IFIEIIQRKN NLGFGEGNFT ALFQSIERDQ MRRGVL
|
| |