Gene Rpal_1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1757 
Symbol 
ID6409414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1885931 
End bp1887226 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content66% 
IMG OID642711645 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_001990760 
Protein GI192290155 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAC CCAAACCGCC CGGATTTGAA ACCCTCAGCC TGCACGCCGG GCAACATCCC 
GATCCGCTGA CCGGCTCGCG CGCGGTGCCG ATCTATCAGA CCACGTCCTA CGTGTTTCAG
GACACCGACC ACGCCGCGGC GCTGTTCAAT ATGGAGCGGC CCGGCCACCT CTACACGCGG
ATCTCCAACC CGACGATCGC GGTGCTGGAA GAGCGGATCG CCGCGCTGGA GAACGGCGTC
GGCGCGGTGG CGACCGCCAG CGGCATGGCG GCGCTGCATC TGGCGATCGC GACGCTGCTG
AATGCCGGCG ATCACATCGT CGCGTCGTCG TCGCTGTACG GCGGCACGAT CAACCTCTTG
GCGCACACGC TGCCGCGGTT CGGCATCACC ACCACCTTCG TGGCGCCTCG CGACCACGCC
GGCCTCGCAG CGGCGATCCA GCCGAACACC AGGCTGGTGA TCGGCGAGAC CATCGGCAAT
CCGGGCCTCG AAGTGCTCGA CATCCCGAAG GTCGCGGCCA TCGCCCATGA GGCGAAAATC
CCGCTCTTGA TCGACAACAC CTTCGCGACG CCGTACCTCA GCAAGCCGAT CGAGCTCGGC
GCCGACATCG TGATGCATTC GGTCACAAAA TGGCTCGGCG GCCACGGCAT CGCCATCGGC
GGCGTGCTGG TCGACGGCGG CCGGTTCGAC TGGCGCGGCT CCGGCAAGTT TCCGACGCTG
ACCGAGCCCT ACGCCGGCTA TCACGACGTC GTCTTCGACG AGCAGTTCGG CCCGCCCGCT
TTCGTCATTC GCGCGCGGAT GGAAGGTCTG CGCGATTTCG GCGCCTGCCT GTCGCCGACC
AACGCTTTCC AGCTGCTGCA GGGCATCGAG ACGCTGCCGG TTCGGATGGA TCGGCATGTC
GCCAACACCA AGGCGGTGCT CGACTTCCTT CAGACCAACA AGGCGGTCGA TTGGGTTCTG
CATCCGACGC TGGACAACCA CCCCGACTAC GAACTGGCAA AGACGCTGCT GCCGAACGGC
GCCGGCTCGA TCATCTCGTT CGGCATCAAG GGCGGCCGTG CCGCCGGGCG CAAGTTCATC
GAGGCGCTGC GGCTGACCAG CCATCTCGCC AATGTCGGCG ACGCCAAGAC GCTGGTGATC
CATCCGGCAT CGACCACGCA TCAGCAGATG AGCGCCGAGC AGCTCGAGGC CGCCGGCATC
GGTGAAGAGC TGATCCGGCT GTCGGTCGGC ATCGAAACCG CCGACGACAT TATCGCTGAC
CTGGCGCAGG CGCTGCGCGT TTCGCAGAAG GGCTGA
 
Protein sequence
MAAPKPPGFE TLSLHAGQHP DPLTGSRAVP IYQTTSYVFQ DTDHAAALFN MERPGHLYTR 
ISNPTIAVLE ERIAALENGV GAVATASGMA ALHLAIATLL NAGDHIVASS SLYGGTINLL
AHTLPRFGIT TTFVAPRDHA GLAAAIQPNT RLVIGETIGN PGLEVLDIPK VAAIAHEAKI
PLLIDNTFAT PYLSKPIELG ADIVMHSVTK WLGGHGIAIG GVLVDGGRFD WRGSGKFPTL
TEPYAGYHDV VFDEQFGPPA FVIRARMEGL RDFGACLSPT NAFQLLQGIE TLPVRMDRHV
ANTKAVLDFL QTNKAVDWVL HPTLDNHPDY ELAKTLLPNG AGSIISFGIK GGRAAGRKFI
EALRLTSHLA NVGDAKTLVI HPASTTHQQM SAEQLEAAGI GEELIRLSVG IETADDIIAD
LAQALRVSQK G