Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3954 |
Symbol | hemK |
ID | 7387302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 3330031 |
End bp | 3330912 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643652692 |
Product | protoporphyrinogen oxidase |
Protein accession | YP_002550867 |
Protein GI | 222149910 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGGCG CACCGGTGAC GCTGAAGCAG GCGATCACTG CCGCTCGTCT TCGGTTTGCG CAAGCGGGTA TCGCCGATGC GCCACGCGAT GCGCGGACCT TGATCGCCGG TCTTCTTGGA CTGACGTCGA CCGATCTTAT CGTGCAGGAC AACCGGGTTT TGAGCGCCGA GGAGACGAGC CTGATCGAGA CGGCAGTCGA GCGACGTCTG TTGTTTGAGC CTGTCCATCG GATTTTGGGA CGGCGGGCCT TTTACGGTCT GGAACTGGCC CTGTCGCCAG CGACATTGGA ACCGCGACCC GATACGGAAA TCCTGATTGA GCGTGTTTTG CCGCATCTTC ATGCCATGGT GGCGAAGAAT GGCAGCGTGC GCCTGCTGGA TATGGGGACG GGCACCGGGG CAATTGCCCT GGCGCTTTTG CAGGAATGTC CGGGTACTAC AGCATTGGCG ACCGATATTT CCGCCGAAGC GTTGGCCATG GCGCGGCAGA ACGCCGCAGC CAACTCACTT TCCGACCGGT TCGAGACGCT GCAAAGCCAT TGGTACGAGG CGTTATCGGG CCGTTTTGAC ATCATTCTGT CGAATCCGCC CTATATTGTC AGCGATGTGA TTAAAGATCT GGCCCCTGAC GTTCGGCTTT ATGATCCTGC CGTAGCACTT GACGGTGGCG ACGATGGGCT GGATGCTTAC CGTGCAATTG CCGCTGGCGC CGCCGATTTT CTGAAACCGG GCGGTCTGGT TGGGGTGGAA ATCGGTTACG ATCAGGCGAT GGCGGTGACG CAACTTTTCG CAAACAATAG CTTCGTTCTT GTGGAAAGCG CAAAGGATCA CGGTGATAAT GATCGGATTC TGTTGTTTGC TCAGACCGGA CCGCAACTAT AG
|
Protein sequence | MTGAPVTLKQ AITAARLRFA QAGIADAPRD ARTLIAGLLG LTSTDLIVQD NRVLSAEETS LIETAVERRL LFEPVHRILG RRAFYGLELA LSPATLEPRP DTEILIERVL PHLHAMVAKN GSVRLLDMGT GTGAIALALL QECPGTTALA TDISAEALAM ARQNAAANSL SDRFETLQSH WYEALSGRFD IILSNPPYIV SDVIKDLAPD VRLYDPAVAL DGGDDGLDAY RAIAAGAADF LKPGGLVGVE IGYDQAMAVT QLFANNSFVL VESAKDHGDN DRILLFAQTG PQL
|
| |