Gene RPB_4252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4252 
SymbolmetX 
ID3912065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4837799 
End bp4839001 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content67% 
IMG OID637886157 
Producthomoserine O-acetyltransferase 
Protein accessionYP_487851 
Protein GI86751355 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.647342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATG TACACCCGGT GAAAGGTCCC GTTGCGACCG GTGGCGAGCG CCCCCACGAG 
GCCGACCATC CGACGTCGCT GGTGGCGTCG TTCGGCGCCG ACCAGCCGCT GCGGCTCGAT
TGCGGCGTCG ACCTCGCCCC GTTCCAGATC GCCTACCAGA CCTATGGCAC GCTGAACGCC
GACAAGAGCA ACGCCATTCT GGTCTGCCAT GCGCTGACCA TGGACCAGCA CATCGCCAAT
GTGCATCCGA TCACCGGCAA GCCGGGCGGA TGGCTGACGC TGGTCGGTCC CGGCAAGCCG
ATCGACACCG ACCGCTATTT CGTCATCTGC TCCAACGTGA TCGGCAGTTG CATGGGCTCG
ACCGGTCCGG CCTCGACCAA TCCGGCCACC GGCAAGGTCT GGGGCCTCGA TTTTCCCGTC
ATCACCATCC CCGACATGGT CCGCGCCCAG GCGATGCTGG TCGACCGGCT CGGCATCGAC
AAATTGTTCT GCGTCGTCGG CGGCTCGATG GGCGGCATGC AGGTGCTGCA ATGGAGCGTC
GCCTATCCGG AGCGGGTGTT CTCGGCGATG CCGATCGCCT GCGCGACGCG GCATTCGGCG
CAGAACATCG CCTTCCACGA GCTCGGCCGC CAGGCGGTGA TGGCCGATCC GGACTGGGCC
CATGGCCGCT ATGTCGAGAC CGGCGCGCAT CCGCATCGCG GCCTCGCGGT GGCGCGGATG
GCCGCGCACA TCACCTATCT GTCCGACGCC GCCTTGCACC GCAAGTTCGG CCGCAGGATG
CAGGACCGCG AACTGCCGAC GTTCTCGTTC GACGCCGACT TCCAGGTCGA GAGCTATCTG
CGCTATCAGG GCTCGTCCTT CGTCGAGCGC TTCGACGCCA ACTCTTATCT CTATCTGACC
CGCGCGATGG ATTATTTCGA CATCGCCGCC GACCATCACG GCGTGCTGGC GGCGGCGTTC
CGCGGCACCC AGACGCGGTT CTGCGTGGTG TCGTTCACCT CCGACTGGCT GTTCCCGACG
CCGGAATCGC GCGCGATCGT GCATGCGCTC AACGCCGGCG GCGCGCGGGT GTCGTTCGCC
GAAGTCGAGA CCGACAAAGG CCACGACGCC TTTCTGCTCG ACGAGCCGGA ATTCATCGAC
ATCGCCCGCG CCTTCCTGCA CTCGGCTGCG ACCGCGCGCG GGCTCGACAA AGCGGGGCGC
TGA
 
Protein sequence
MMNVHPVKGP VATGGERPHE ADHPTSLVAS FGADQPLRLD CGVDLAPFQI AYQTYGTLNA 
DKSNAILVCH ALTMDQHIAN VHPITGKPGG WLTLVGPGKP IDTDRYFVIC SNVIGSCMGS
TGPASTNPAT GKVWGLDFPV ITIPDMVRAQ AMLVDRLGID KLFCVVGGSM GGMQVLQWSV
AYPERVFSAM PIACATRHSA QNIAFHELGR QAVMADPDWA HGRYVETGAH PHRGLAVARM
AAHITYLSDA ALHRKFGRRM QDRELPTFSF DADFQVESYL RYQGSSFVER FDANSYLYLT
RAMDYFDIAA DHHGVLAAAF RGTQTRFCVV SFTSDWLFPT PESRAIVHAL NAGGARVSFA
EVETDKGHDA FLLDEPEFID IARAFLHSAA TARGLDKAGR