Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_0228 |
Symbol | |
ID | 5344609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | + |
Start bp | 238095 |
End bp | 239213 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640837790 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001373589 |
Protein GI | 152974072 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAC AATCTATGGA TGCGTTGACT GCACAAATGG AGGATTTTTT CCCAGTACGT GATGTAGACC ATGTAGAATT TTATGTAGGA AATGCAAAGC AGGCAAGTTA TTATCTTGCA AGAGCATTTG GCTTCAAAAT CGTAGCTTAT TCTGGATTAG AGACAGGAAA CCGCGAAAAG GTATCTTATG TTCTTGTACA AAAAAATATG CGCTTTGTTG TGTCTGGAGC ATTAAATAGT GATAATCGTA TCGCAGAATT TGTAAAACTT CATGGAGATG GGGTAAAAGA CGTTGCGCTA CTTGTTGACG ATGTCGAGAA AGCATACTCA GAAGCTGTGA AACGAGGTGC TGTTGCAATT TCCCCTCCGG AAGAGTTAAC AGATGAGAAT GGAGCAATAA AAAAAGCAGT AATTGGTACG TATGGGGATA CAATTCATAC GCTTGTAGAG CGTAAAAACT ATAAAGGAAC ATTTATGCCT GGCTATGAAA AAGCTGAGTT CGAAGTCCCA TCAGAAGAAG CAGGTTTAAT TGCTGTAGAC CATGTTGTCG GTAACGTTGA GAAAATGGAA GAATGGGTTA GCTATTACGA AAATGTTATG GGCTTTAAAC AAATGATTCA TTTTGATGAT GAAGATATTA GTACGGAATA TTCTGCGTTG ATGTCAAAAG TTATGACAAA TGGAAGTCGT ATTAAATTCC CGATTAATGA GCCTGCAGAA GGGAAACGAA AGTCGCAAAT ACAAGAATAC TTAGAGTTTT ACAATGGTGA GGGTGTACAA CACCTTGCAT TGTTAACAAA TGATATTGTC AAAACAGTTG AAGCGCTTCG TGCCAATGGA GTTGAATTTT TAGACACACC AGATACGTAT TATGAAGAAT TAACGGCACG TGTCGGAGAA ATCGATGAGG AAGTTGAAAA ATTAAAAGAG CTTAAGATTT TAGTAGACCG TGATGATGAA GGATACTTAC TACAAATTTT TACAAAACCG ATTGTAGATC GTCCAACCTT ATTTATTGAA ATTATTCAGC GTAAAGGATC TCGTGGATTC GGAGAAGGGA ACTTTAAGGC GTTATTTGAA TCAATTGAAA GAGAGCAAGA ACGTCGCGGA AACCTATAA
|
Protein sequence | MKQQSMDALT AQMEDFFPVR DVDHVEFYVG NAKQASYYLA RAFGFKIVAY SGLETGNREK VSYVLVQKNM RFVVSGALNS DNRIAEFVKL HGDGVKDVAL LVDDVEKAYS EAVKRGAVAI SPPEELTDEN GAIKKAVIGT YGDTIHTLVE RKNYKGTFMP GYEKAEFEVP SEEAGLIAVD HVVGNVEKME EWVSYYENVM GFKQMIHFDD EDISTEYSAL MSKVMTNGSR IKFPINEPAE GKRKSQIQEY LEFYNGEGVQ HLALLTNDIV KTVEALRANG VEFLDTPDTY YEELTARVGE IDEEVEKLKE LKILVDRDDE GYLLQIFTKP IVDRPTLFIE IIQRKGSRGF GEGNFKALFE SIEREQERRG NL
|
| |