Gene RPD_2703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2703 
Symbol 
ID4023201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3019567 
End bp3020847 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID637962902 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_569833 
Protein GI91977174 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.680985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.221221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC GCAGCCCGGG TTTTGCCACG CTCGCGGTCC ACGCAGGCGC GCAGCCCGAT 
CCCACCACAG GCGCACGGGC GACGCCGATC TACCAGACCA CCTCGTTCGT CTTCAACGAC
GCCGACCATG CCGCCTCGCT GTTCGGGCTG CAGGCGTTCG GCAACATCTA CACCCGCATC
ACCAACCCGA CGACCGCGGT GCTGGAAGAG CGCGTCGCCG CGCTGGAGGG CGGCACTGCG
GCGCTTGCGA CCGCGTCCGG CCATGCGGCG CAGCTTGTGG CGTTTCAGCA GCTGCTGCGG
CCCGGCGACG AGTTCATTGC CGCCCGCAAA TTGTACGGTG GCTCGATCAA CCAGTTCACC
CACGCCTTCA AGAGCTTCGG CTGGAACGTG GTGTGGGCCG ATCCTGACGA TCTCGGCAGC
TTCCAGCGCG CGGTGTCGCC GAAGACCAAG GCGATCTTCA TCGAGTCGAT CGCCAATCCC
GGTGGCAGCA TCACCGACAT CGAGGCGATC GCCGAGGTCG CGCGCAACGC CGGTGTGCCG
CTGATCGTCG ACAACACGCT GGCGACGCCC TATCTGATCC GCCCGATCGA CCATGGCGCC
GACATCGTCG TGCATTCGCT GACCAAATTT CTCGGCGGCC ACGGCAATTC GCTGGGCGGC
ATCATCGTCG ACGCCGGCAC CTTCGATTGG TCGCGCGACG GCAAATATCC GATGCTGAGC
GAGCCGCGGC CGGAGTATCA CGGTCTCAAG CTGCAGGAGA CGTTTGGAAA CTTCTCCTTC
GCGATCGCCT GCCGCGTGCT CGGCCTGCGC GATCTCGGTC CGGCGCTGTC GCCGTTCAAC
GCCTTCCTGC TCATGACCGG AATCGAGACG CTGCCGCTGC GGATGCAGAA GCATTGCGAG
AACGCCAAGG CGATCGCCGA GTTCCTCTCG ACCCACAAGG CGGTGTCCTC GGTGAACTAT
GCGGGGCTGG CGTCGAGCAA GTACAACGCG CTCGCGCGCA AATACGCGCC GAAGGGGGCC
GGCGCGGTGT TCACCTTCGG CCTCAAGGGT GGTTATCAGG CCGGCGTCGA TCTGGTCTCG
AAGCTGAAGC TGTTCTCACA CCTCGCCAAT GTCGGCGATA CCCGTTCGCT GATCATTCAT
CCGGCCTCGA CCACCCACAG CCAGCTCGAC GATGCGCAGA AGACCGCCGC CGGCGCCGCG
CCCGACATGG TGCGCGTCTC GATCGGCATC GAGGACAAGG AAGACCTGAT CGCGGATCTC
GACGAGGCGC TCGGCGGTTG A
 
Protein sequence
MTERSPGFAT LAVHAGAQPD PTTGARATPI YQTTSFVFND ADHAASLFGL QAFGNIYTRI 
TNPTTAVLEE RVAALEGGTA ALATASGHAA QLVAFQQLLR PGDEFIAARK LYGGSINQFT
HAFKSFGWNV VWADPDDLGS FQRAVSPKTK AIFIESIANP GGSITDIEAI AEVARNAGVP
LIVDNTLATP YLIRPIDHGA DIVVHSLTKF LGGHGNSLGG IIVDAGTFDW SRDGKYPMLS
EPRPEYHGLK LQETFGNFSF AIACRVLGLR DLGPALSPFN AFLLMTGIET LPLRMQKHCE
NAKAIAEFLS THKAVSSVNY AGLASSKYNA LARKYAPKGA GAVFTFGLKG GYQAGVDLVS
KLKLFSHLAN VGDTRSLIIH PASTTHSQLD DAQKTAAGAA PDMVRVSIGI EDKEDLIADL
DEALGG