Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4099 |
Symbol | metX |
ID | 4024621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4562581 |
End bp | 4563780 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637964307 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_571219 |
Protein GI | 91978560 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTAC ATCCGGTAAA GGGACAGCTC GCCGCCAATG GCGAGCGGAC GCACGAGGCG GACCATCCGC ATTCGCAGGT CGCGCTGTTC GGCGCCGACC AGCCGCTGCG GCTCGATTGC GGCGTCGACC TCGCCCCGTT CCAGATCGCC TACCAGACCT ATGGCGAACT CAACGCCGAC AAGAGCAACG CCATCCTGGT TTGCCACGCG CTGACGATGG ATCAGCACAT CGCCAATGTG CATCCGATCA CCGGCAAGCC CGGCGGCTGG CTGACGCTGG TCGGCGCCGG CAAGCCGATC GATACCAGTC GCTATTTCGT GATCTGCTCG AATGTGATCG GCAGCTGCAT GGGCTCCACC GGCCCGGCCT CGACCAATCC GGCCACCGGC AAACCCTGGG GGCTGGATTT TCCGGTCATC ACAATCCCCG ACATGGTCCG GGCGCAGGCG ATGCTGATCG ACCGGCTCGG GATCGAGACG CTGTTCTGCG TAGTCGGCGG CTCGATGGGC GGGATGCAGG CGCTGCAATG GAGCGTGGCC TATCCGGAGC GGGTGTATTC GGCGCTGGCG GTGGCCTGCG CCACGCGGCA CTCGGCGCAG AACATCGCGT TCCACGAACT CGGCCGCCAG GCGGTGATGG CCGATCCGGA CTGGCGGCAC GGCCGCTATT TCGAGGAAGG CTGCTATCCG CATCGCGGCC TCGGCGTGGC GCGGATGGCC GCGCACATCA CCTATCTGTC CGACGCCGCG CTGCATCGCA AGTTCGGCCG CAGGATGCAG GATCGCGACC TGCCGACGTT CTCGTTCGAC GCCGATTTCC AGGTCGAGAG CTATCTGCGC TATCAGGGCT CGTCCTTCGT CGAGCGGTTC GACGCCAACA GCTATTTGTA TCTGACCCGC GCGATGGATT ATTTCGACAT CGCCGCGGAC CACAACGGTG TTCTGGCGGA GGCGTTTCGC GGCACCACGA CGCGGTTCTG CGTGGTGTCG TTCACGTCCG ACTGGCTGTT CCCGACCTCG GAGTCGCGCG CAGTCGTGCA CGCGCTCAAC GCCGGCGGCG CACGCGTGTC GTTCGCCGAG ATCGAGACCG ACCGCGGTCA CGACGCCTTC CTGCTCGACG TGCCGGAGTT CATCGACATC GCCCGCGCCT TCCTGCACTC GGCGGCGACG GCGCGCGGGC TCGGCAAAGC GGGGCGCTGA
|
Protein sequence | MNVHPVKGQL AANGERTHEA DHPHSQVALF GADQPLRLDC GVDLAPFQIA YQTYGELNAD KSNAILVCHA LTMDQHIANV HPITGKPGGW LTLVGAGKPI DTSRYFVICS NVIGSCMGST GPASTNPATG KPWGLDFPVI TIPDMVRAQA MLIDRLGIET LFCVVGGSMG GMQALQWSVA YPERVYSALA VACATRHSAQ NIAFHELGRQ AVMADPDWRH GRYFEEGCYP HRGLGVARMA AHITYLSDAA LHRKFGRRMQ DRDLPTFSFD ADFQVESYLR YQGSSFVERF DANSYLYLTR AMDYFDIAAD HNGVLAEAFR GTTTRFCVVS FTSDWLFPTS ESRAVVHALN AGGARVSFAE IETDRGHDAF LLDVPEFIDI ARAFLHSAAT ARGLGKAGR
|
| |