Gene RPD_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4099 
SymbolmetX 
ID4024621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4562581 
End bp4563780 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content66% 
IMG OID637964307 
Producthomoserine O-acetyltransferase 
Protein accessionYP_571219 
Protein GI91978560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTAC ATCCGGTAAA GGGACAGCTC GCCGCCAATG GCGAGCGGAC GCACGAGGCG 
GACCATCCGC ATTCGCAGGT CGCGCTGTTC GGCGCCGACC AGCCGCTGCG GCTCGATTGC
GGCGTCGACC TCGCCCCGTT CCAGATCGCC TACCAGACCT ATGGCGAACT CAACGCCGAC
AAGAGCAACG CCATCCTGGT TTGCCACGCG CTGACGATGG ATCAGCACAT CGCCAATGTG
CATCCGATCA CCGGCAAGCC CGGCGGCTGG CTGACGCTGG TCGGCGCCGG CAAGCCGATC
GATACCAGTC GCTATTTCGT GATCTGCTCG AATGTGATCG GCAGCTGCAT GGGCTCCACC
GGCCCGGCCT CGACCAATCC GGCCACCGGC AAACCCTGGG GGCTGGATTT TCCGGTCATC
ACAATCCCCG ACATGGTCCG GGCGCAGGCG ATGCTGATCG ACCGGCTCGG GATCGAGACG
CTGTTCTGCG TAGTCGGCGG CTCGATGGGC GGGATGCAGG CGCTGCAATG GAGCGTGGCC
TATCCGGAGC GGGTGTATTC GGCGCTGGCG GTGGCCTGCG CCACGCGGCA CTCGGCGCAG
AACATCGCGT TCCACGAACT CGGCCGCCAG GCGGTGATGG CCGATCCGGA CTGGCGGCAC
GGCCGCTATT TCGAGGAAGG CTGCTATCCG CATCGCGGCC TCGGCGTGGC GCGGATGGCC
GCGCACATCA CCTATCTGTC CGACGCCGCG CTGCATCGCA AGTTCGGCCG CAGGATGCAG
GATCGCGACC TGCCGACGTT CTCGTTCGAC GCCGATTTCC AGGTCGAGAG CTATCTGCGC
TATCAGGGCT CGTCCTTCGT CGAGCGGTTC GACGCCAACA GCTATTTGTA TCTGACCCGC
GCGATGGATT ATTTCGACAT CGCCGCGGAC CACAACGGTG TTCTGGCGGA GGCGTTTCGC
GGCACCACGA CGCGGTTCTG CGTGGTGTCG TTCACGTCCG ACTGGCTGTT CCCGACCTCG
GAGTCGCGCG CAGTCGTGCA CGCGCTCAAC GCCGGCGGCG CACGCGTGTC GTTCGCCGAG
ATCGAGACCG ACCGCGGTCA CGACGCCTTC CTGCTCGACG TGCCGGAGTT CATCGACATC
GCCCGCGCCT TCCTGCACTC GGCGGCGACG GCGCGCGGGC TCGGCAAAGC GGGGCGCTGA
 
Protein sequence
MNVHPVKGQL AANGERTHEA DHPHSQVALF GADQPLRLDC GVDLAPFQIA YQTYGELNAD 
KSNAILVCHA LTMDQHIANV HPITGKPGGW LTLVGAGKPI DTSRYFVICS NVIGSCMGST
GPASTNPATG KPWGLDFPVI TIPDMVRAQA MLIDRLGIET LFCVVGGSMG GMQALQWSVA
YPERVYSALA VACATRHSAQ NIAFHELGRQ AVMADPDWRH GRYFEEGCYP HRGLGVARMA
AHITYLSDAA LHRKFGRRMQ DRDLPTFSFD ADFQVESYLR YQGSSFVERF DANSYLYLTR
AMDYFDIAAD HNGVLAEAFR GTTTRFCVVS FTSDWLFPTS ESRAVVHALN AGGARVSFAE
IETDRGHDAF LLDVPEFIDI ARAFLHSAAT ARGLGKAGR