Gene RPB_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2667 
Symbol 
ID3910460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3049965 
End bp3051245 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID637884567 
ProductO-acetylhomoserine aminocarboxypropyltransferase 
Protein accessionYP_486280 
Protein GI86749784 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.338224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.137932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC GCAGCCCGGG ATTTGCCACG CTCGCGGTTC ACGCCGGCGC GCAGCCCGAT 
CCCACCACCG GGGCGCGGGC GACGCCGATC TACCAGACCA CCTCCTTCGT GTTCAACGAC
GCCGACCACG CCGCCTCGCT GTTCGGCCTG CAGGCGTTCG GCAACATCTA CACCCGCATC
ACCAATCCGA CGACGGCGGT GCTCGAGGAG CGCGTCGCCG CGCTCGAAGG CGGCACCGCG
GCGCTCGCGA CCGCGTCCGG CCACGCCGCG CAGCTCGTGG TGATGCAGCA ATTGCTGATG
CCCGGCGACG AATTCATCGC CGCGCGAAAA CTCTACGGCG GCTCGATCAA CCAGTTCACC
CACGCCTTCA AGAGCTTCGG CTGGAACGTG GTGTGGGCCG ATCCCGACGA CATCGACAGC
TTCCAGCGCG CGGTGACGCC GAAGACCAAG GCGATCTTCA TCGAATCGAT CGCCAATCCG
GCGGGCTCCA TCACCGATAT CGAGGCGATC GCCGAAGTCG CGCGCAGTGC CGGCGTGCCG
CTGATCGTCG ACAACACCCT GGCGACGCCC TATCTGATCC GCCCGATCGA CCACGGCGCC
GACATCGTCG TGCATTCGCT GACGAAATTT CTCGGCGGCC ACGGCAATTC GCTCGGCGGC
ATCATCGTCG ACGCCGGCAC CTTCGACTGG TCGAAGGGCG GCAAATATCC GATGCTGAGC
GAGCCGCGGC CGGAATATCA CGGGCTGAAG CTGCAGGAGA CGTTCGGCAA TTTCGCCTTC
GCGATCGCCT GCCGCGTGCT CGGCCTGCGC GACCTCGGCC CGGCGCTGTC GCCGTTCAAC
GCCTTCCTGC TGATGACCGG CATCGAGACG CTGCCGCTGC GGATGCAGAA GCATTGCGAG
AACGCCAAGG CGATCGCCGA ATTCCTGGCG ACCCACAAGG CGGTGTCGGC GGTGAACTAT
TCCGGCCTGG CGTCGAGCAA GTACAATGCG CTGGCCCGCA AATACGCGCC GAAGGGCGCC
GGCGCGGTGT TCACCTTCAG CCTCAAGGGT GGCTACCAGG CCGGCGTCGA TCTGGTCTCC
AATGTGAAGC TGTTCTCGCA TCTCGCCAAT GTCGGCGACA CCCGTTCGCT GATCATCCAT
CCGGCCTCGA CCACCCACAG CCAGCTCGAC GACGCGCAGA AGACGGCGGC CGGCGCCGCG
CCGGACATGG TGCGGGTGTC GATCGGCATC GAGGACAAGG AAGATCTGAT CGCGGATCTC
GACGGAGCGC TCGGCGGCTG A
 
Protein sequence
MTERSPGFAT LAVHAGAQPD PTTGARATPI YQTTSFVFND ADHAASLFGL QAFGNIYTRI 
TNPTTAVLEE RVAALEGGTA ALATASGHAA QLVVMQQLLM PGDEFIAARK LYGGSINQFT
HAFKSFGWNV VWADPDDIDS FQRAVTPKTK AIFIESIANP AGSITDIEAI AEVARSAGVP
LIVDNTLATP YLIRPIDHGA DIVVHSLTKF LGGHGNSLGG IIVDAGTFDW SKGGKYPMLS
EPRPEYHGLK LQETFGNFAF AIACRVLGLR DLGPALSPFN AFLLMTGIET LPLRMQKHCE
NAKAIAEFLA THKAVSAVNY SGLASSKYNA LARKYAPKGA GAVFTFSLKG GYQAGVDLVS
NVKLFSHLAN VGDTRSLIIH PASTTHSQLD DAQKTAAGAA PDMVRVSIGI EDKEDLIADL
DGALGG