Gene RPD_3718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3718 
Symbol 
ID4024234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4152701 
End bp4153996 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID637963922 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_570840 
Protein GI91978181 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00194867 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCAC CCAAACCGCC CGGATTCGAA ACGCTCAGCC TGCACGCCGG GCAACAGCCC 
GACCCGCTGA CCGGCTCGCG CGCAGTGCCG ATCTATCAGA CCACATCCTA CGTGTTTCAG
GACACCGACC ATGCGGCCGC GCTGTTCAAC ATGGAGCGCG CCGGGCATCT GTATACGCGG
ATCTCGAACC CGACGATCGC CGTGCTGGAA GAGCGTGTCG CCGCGCTGGA GAACGGCGTC
GGCGCGGTGG CGACCGCGAG CGGCATGGCG GCGTTACATC TGGCGATCGC GACGCTGCTC
AACGCCGGCG ACCACATCGT CGCCTCCAGC TCGCTCTATG GCGGCACCAT CAATCTCTTG
ACGCACACGC TGCCGCGTTT CGGCATCACG ACCAGTTTCG TCAAGCCGCG CGATCACGCC
GGGCTCAAAG CCGCGATCAA GCCGAACACC AGACTGGTGA TCGGCGAGAC GATCGGCAAT
CCCGGGCTCG AAGTGCTCGA CATTCCGAAG GTCGCGGCGA TCGCCCACGA CGCGAAGATC
CCGCTGTTGA TCGACAACAC GTTTGCGACG CCCTATTTGA GCAGACCGAT CGAGCTCGGC
GCCGACATTG TGATGCATTC GGCGACCAAA TGGCTCGGCG GCCACGGCAT CGCGATCGGC
GGCGTGCTGG TCGATGGCGG GCGATTCGAC TGGCGCGGCT CCGGCAAATT CCCGACACTG
ACCGAGCCCT ATGCCGGCTA TCACGACATC GTCTTCGACG AACAGTTCGG GCCGCCGGCC
TTTATCATCC GCGCGCGGAT GGAAGGCCTG CGTGATTTCG GCGCCTGTCT GTCGCCGACC
AACGCGTTCC AGCTCATTCA AGGCGTCGAG ACGCTCCCGG TGCGGATGGA CCGCCATCTC
GCGAATACCA AGGCGGTGCT CGACTTCCTC GGCGGCAACA AAGCAGTCGA GTGGGTGCTG
CATCCGACGC TGCAGAGTCA TCCGGACTAC GCGCTGGCGA AAGAGCTGCT GCCGAAGGGC
GCCGGTTCGA TCATATCGTT CGGCATCAAA GGCGGCCGCG CCGCGGGGCG GAAGTTCATC
GAGGCGCTGC GGCTCACCAG CCATCTCGCC AATGTCGGCG ACGCCAAGAC GCTGGTGATC
CATCCGGCCT CGACCACGCA TCAGCAGATG GACGCCGCGC AACTCGCGAC CGCCGGCATC
GGCGAGGAAT TGATCCGGCT GTCGGTCGGC ATCGAGACCG CCGACGACAT CATCGCCGAC
CTTGCGCAGG CGCTGCGCGT TTCGCAGAAG GTGTAA
 
Protein sequence
MAAPKPPGFE TLSLHAGQQP DPLTGSRAVP IYQTTSYVFQ DTDHAAALFN MERAGHLYTR 
ISNPTIAVLE ERVAALENGV GAVATASGMA ALHLAIATLL NAGDHIVASS SLYGGTINLL
THTLPRFGIT TSFVKPRDHA GLKAAIKPNT RLVIGETIGN PGLEVLDIPK VAAIAHDAKI
PLLIDNTFAT PYLSRPIELG ADIVMHSATK WLGGHGIAIG GVLVDGGRFD WRGSGKFPTL
TEPYAGYHDI VFDEQFGPPA FIIRARMEGL RDFGACLSPT NAFQLIQGVE TLPVRMDRHL
ANTKAVLDFL GGNKAVEWVL HPTLQSHPDY ALAKELLPKG AGSIISFGIK GGRAAGRKFI
EALRLTSHLA NVGDAKTLVI HPASTTHQQM DAAQLATAGI GEELIRLSVG IETADDIIAD
LAQALRVSQK V