Gene RPB_2954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2954 
Symbol 
ID3910753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3363230 
End bp3364321 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID637884860 
Producthypothetical protein 
Protein accessionYP_486567 
Protein GI86750071 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1509] Lysine 2,3-aminomutase 
TIGRFAM ID[TIGR00238] KamA family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.583634 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGG TCACACGATT GCAAACGCCC GCCGCCACAA CACTGCGGCA ACCCTCTGAG 
CTGATCGCGC AAGGTCTCGC GCCGGCGGAG TCGCGCGACG ATCTCGAACA GGTCGCGGCA
CGCTACGCGA TCGCGGTGAC GCCCGATGTC GCGGCGCTGA TCGATCCGAA CGATCCCCAC
GATCCGATCG CGCGGCAATA CATCCCGCGT GCCGACGAAC TCGTCACGCT GCCGATCGAG
CGCGACGATC CGATCGGCGA CGGCGCCCAT GCGCCGGTCG AGGGCATCGT GCATCGCCAC
CGCGACCGCG TGCTGCTGAA GCTGGTGCAT GTCTGCGCGG TGTATTGCCG GTTCTGTTTC
CGCCGCGAGA CGATCGGCCC CGGCAAGGAC AACGCGCTGT CGCGCGAGGC GACCGCGGCG
GCGCTCGACT ACATCCGGGC CCACCCGGAA ATCTGGGAGG TGATCTTCAC CGGCGGCGAT
CCGTTGATGC TGTCGCCGCG CCGGATGGCC GAGATCATGG CCGAACTGGC GACCATCGCG
CACGTCAAGA TCATCCGCTT CCACACCCGC GTGCCGGTTG CCGATCCTGC GCGGATCACG
CCGGAGCTGG TGCGGGCGTT GCAGACACCC GGGAAAACCA CCTGGGTCGC GCTTCACGCC
AACCACCCGC GCGAGCTCAC CGCGGCTGCG CGCGCCGCGT GTGCGATGCT GATAGATGCC
GGGATTCCGA TGGTCAGCCA GTCGGTGCTG TTGCGCGGTG TGAACGATGA TTCGGAGACG
CTGGAAGCCT TGATGCGCGG CTTCGTCGAA TGCCGGATCA AGCCGTATTA TCTGCATCAC
GGCGACCTCG CGCCGGGCAC CGCGCATCTG CGCACCACGA TCGCGGAAGG GCAGGCGCTG
ATGCGGGCGC TGCGCGGCCG CGTCTCCGGC CTGTGCCAGC CGGAATACGT GCTCGACATT
CCCGGCGGCT ACGGCAAGGC GCCGATCGGT CCGAACTATT TGACGGGCGA GGATGGAACA
GTCGCGGATT CGCGCTATCG TGTCCGCGAC TATTGCGGTG ACGTCCATCT CTATCCGCCC
GGCTCGTGCT GA
 
Protein sequence
MSRVTRLQTP AATTLRQPSE LIAQGLAPAE SRDDLEQVAA RYAIAVTPDV AALIDPNDPH 
DPIARQYIPR ADELVTLPIE RDDPIGDGAH APVEGIVHRH RDRVLLKLVH VCAVYCRFCF
RRETIGPGKD NALSREATAA ALDYIRAHPE IWEVIFTGGD PLMLSPRRMA EIMAELATIA
HVKIIRFHTR VPVADPARIT PELVRALQTP GKTTWVALHA NHPRELTAAA RAACAMLIDA
GIPMVSQSVL LRGVNDDSET LEALMRGFVE CRIKPYYLHH GDLAPGTAHL RTTIAEGQAL
MRALRGRVSG LCQPEYVLDI PGGYGKAPIG PNYLTGEDGT VADSRYRVRD YCGDVHLYPP
GSC