Gene RPC_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1331 
Symbol 
ID3974070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1447382 
End bp1448677 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID637924441 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_531212 
Protein GI90422842 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.351506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAC CCAAACCACC CGGTTTCGAA ACCCTCAGCC TGCACGCCGG CCAACAGCCC 
GATCCGCTCA CCGGATCGCG CGCAGTGCCG ATCTATCAAA CCACCTCCTA CGTGTTTCAG
GACGCCGATC ACGCCGCGGC CTTATTCAAT CTGGAACGCC CCGGCCACAT CTACACGCGC
ATATCAAACC CAACGACCGC GGTCCTTGAA GAACGGCTCG CAGCACTCGA AGCCGGCGTC
GGCGCGGTCT GCACCGCGAG CGGCATGGCG GCGCTGCATC TCGCCATCGC CACGCTGCTC
AGCGCCGGCG ATCACATCGT CGCGTCATCA TCGCTCTATG GCGGCACCAT CAATCTGCTC
ACCCACACGC TGCCGCGGTT CGGCATCACG ACAAGCTTCG TGAAGCCGCG CGATCTCGAC
GGCATTGCAG CAGCAATCCA ACCCAACACG CGTCTGGTGA TCGGCGAAAC CATCGGCAAT
CCCGGGCTCG AAGTCTTGGA CATCCCCAAG GTCGCGGCGA TTGCGCACGC TGCGAAGATT
CCGCTGTTGA TCGACAACAC CTTCGCCACG CCGTATTTAT GCAGGCCGAT CGAGCACGGC
GCCGACATCG TGATGAACTC CGCCACCAAA TGGATCGGCG GCCACGGCAT CGCGATCGGC
GGCGTGATCG TCGACGGCGG CCGCTTCGAC TGGCGCGGCT CCGGCAAATT CCCGACGCTG
ACCGAGCCCT ATGCCGGCTA TCACGACATC GTCTTCGACG AACAATTCGG CCCGCCCGCC
TTCATCATGC GGGCACGGAT GGAAGGGCTG CGCGACTTCG GCGCGTGCCT GTCGCCAACC
AACGCCTTTC AACTGCTGCA AGGCGTCGAA ACGCTGGGAC TGCGGATGGA TCGCCACATC
GCCAACACCG CGGCGGTGAT CGCGTTCTTG GCCACGCACA AGGCGGTGGA GTGGCTGCTG
CATCCGTCGC TGGAGAATCA CCCGGACCAC GCATTGGCGA AGCAGCTGCT GCCGAAAGGC
GCCGGTTCGA TCGTCTCGTT CGGCATCAAG GGCGGCCGCG CCGCCGGCAA GAAATTCATC
GAGGCGCTGC GGCTGACCAG CCATCTTGCC AATGTCGGCG ACGCCAAGAC GCTGGTGATC
CATCCGGCCT CGACCACGCA TCAGCAGATG AGTGCGGAGC AGCTTGCCGC CGCCGGCGTC
GGCGAGGAGT TAGTTCGGCT GTCGGTCGGG CTGGAGTCGG CACAGGACAT CACCGACGAC
CTTGGCCAGG CGCTGCGCGC CTCGCAGAAA GGCTGA
 
Protein sequence
MPAPKPPGFE TLSLHAGQQP DPLTGSRAVP IYQTTSYVFQ DADHAAALFN LERPGHIYTR 
ISNPTTAVLE ERLAALEAGV GAVCTASGMA ALHLAIATLL SAGDHIVASS SLYGGTINLL
THTLPRFGIT TSFVKPRDLD GIAAAIQPNT RLVIGETIGN PGLEVLDIPK VAAIAHAAKI
PLLIDNTFAT PYLCRPIEHG ADIVMNSATK WIGGHGIAIG GVIVDGGRFD WRGSGKFPTL
TEPYAGYHDI VFDEQFGPPA FIMRARMEGL RDFGACLSPT NAFQLLQGVE TLGLRMDRHI
ANTAAVIAFL ATHKAVEWLL HPSLENHPDH ALAKQLLPKG AGSIVSFGIK GGRAAGKKFI
EALRLTSHLA NVGDAKTLVI HPASTTHQQM SAEQLAAAGV GEELVRLSVG LESAQDITDD
LGQALRASQK G