Gene Rpal_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3102 
Symbol 
ID6410773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3346611 
End bp3347891 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID642712982 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_001992083 
Protein GI192291478 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.444026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAC GCAACCCAGG ATTCGCGACG CTCGCGGTCC ATGCCGGCGC CCAGCCCGAT 
CCCACCACCG GTGCGCGCGC GACGCCGATC TATCAGACCA CATCGTTCGT CTTCAACGAC
GCCGACCATG CCGCCTCGCT GTTCGGCCTG CAGGCGTTCG GCAACATCTA CACCCGCATC
ACCAACCCGA CGACGTCGGT GCTCGAAGAG CGCGTCGCCG CGCTCGAAGG CGGCACCGCC
GCGCTCGCCA CCGCTTCGGG CCACGCCGCC CAGCTCGTCG CGCTGCAGCA ACTGATGCAG
CCCGGCGACG AGTTTATCGC CGCGCGCAAA CTCTACGGCG GCTCGATCAA TCAGTTCACC
CACGCCTTCA AGAGCTTCGG CTGGAACGTG GTGTGGGCCG ACACCGACGA CCTCGCCAGC
TTCCAGCGGG CGGTGTCGCC GAAGACCAAG GCGATCTTCA TCGAGTCGAT CGCCAATCCG
GGCGGCAGCA TCACCGACAT CGAGGCGGTG GCGGAAGTCG CGCGCAATGC GGGCGTGCCG
CTGATCGTCG ACAACACGTT GGCGACGCCC TACCTGATCC GCCCGATCGA CCATGGCGCG
GACATCGTGG TGCATTCGCT CACCAAGTTC CTGGGCGGCC ACGGCAACTC GCTCGGTGGT
ATCATCGTCG ATGCCGGCAC CTTCGACTGG TCGAAGGACG GCAAATATCC GATGCTGAGC
GAGCCGCGCC CCGAATATCA CGGCCTGAAG ATCCAGGAGA CGTTCGGCAA CTTCTCGTTC
GCGATTGCCT GCCGCGTGCT CGGCCTGCGC GATCTCGGGC CCGCGCTGTC GCCGTTCAAC
GCCTTTCTGC TGCTGACCGG CATCGAGACG CTGCCGCTGC GTATGCAGAA GCACTGCGAG
AACGCCAAGG CGATTGCTGA ATTCCTCTCC ACCCACAAGG CGGTGGACGA GGTCAACTAC
TCCGGCCTCG CCTCCAGCAA ATACGCCGCC CTCGCCCGCA AATATGCGCC GAAGGGCGCC
GGCGCGGTGT TCACCTTCAG CCTCAAGGGC GGCTATCAGG CGGGCATCGA TCTGGTCGCC
AATCTGAAGC TGTTCTCGCA TTTGGCCAAT GTCGGCGACA CTCGCTCGCT GATCATCCAC
CCGGCCTCGA CCACCCACAG CCAGCTTGAC GACGCCCAGA AGACCGCCGC CGGCGCCGCG
CCCAACATGG TGCGGGTGTC GATCGGCATC GAGGACAAGG ACGACCTGAT CGCCGACCTC
GATCAGGCAC TCGGCGGCTG A
 
Protein sequence
MTERNPGFAT LAVHAGAQPD PTTGARATPI YQTTSFVFND ADHAASLFGL QAFGNIYTRI 
TNPTTSVLEE RVAALEGGTA ALATASGHAA QLVALQQLMQ PGDEFIAARK LYGGSINQFT
HAFKSFGWNV VWADTDDLAS FQRAVSPKTK AIFIESIANP GGSITDIEAV AEVARNAGVP
LIVDNTLATP YLIRPIDHGA DIVVHSLTKF LGGHGNSLGG IIVDAGTFDW SKDGKYPMLS
EPRPEYHGLK IQETFGNFSF AIACRVLGLR DLGPALSPFN AFLLLTGIET LPLRMQKHCE
NAKAIAEFLS THKAVDEVNY SGLASSKYAA LARKYAPKGA GAVFTFSLKG GYQAGIDLVA
NLKLFSHLAN VGDTRSLIIH PASTTHSQLD DAQKTAAGAA PNMVRVSIGI EDKDDLIADL
DQALGG