Gene RPC_4536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4536 
Symbol 
ID3972085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5066511 
End bp5067632 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID637927647 
ProductNLPA lipoprotein 
Protein accessionYP_534377 
Protein GI90426007 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR03427] ABC transporter periplasmic binding protein, urea carboxylase region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.996244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTATC GCCATTCCCT TTTTGTATTT TTTACAAGAG GGATCGCAAT GAACAAACTC 
AGCTCCATTC TCCGAAACAT GGCTCTGGCG TCGGCAATGA TCGCCGTAGC CGTGTCGAGT
GCCGACGCCG CGCCGAAGAA GGATTTCAAG GTCGCTTGGT CGATCTATGT CGGCTGGATG
CCGTGGGGCT ATGCCGCCGA CACCGGCATC GTCAAGAAAT GGGCCGACAA ATACGGCATC
ACCATCGAGG TCAAGCAGTT CAACGACTAC GTCGAATCGA TCAACCAATA CACCGCCGGC
TCCTATGACG CGGTGACCAT CACCAACATG GACGCGCTGT CGATCCCCGC CGCCGGCGGC
GTCGACACCA CCGCGCTGGT GATGGGCGAC TTTTCCAACG GCAACGACGC GGTGATCCTG
AAGGGCAAGT CGGATCTCGC CGCGATCAAG GGCCAGAAGG TCAACCTGGT CGAGTTCTCG
GTGTCGCATT ATCTCTTGGC GCGCGCGCTG GAGACCAAGA AGCTCGCCGA AAAGGACATC
AAGGTCGTCA ACACCTCCGA CGCCGATCTC GCCGCCGCCT ACAAGACCCC CGAAGTCACC
GCGGTGGTGA CCTGGAATCC GATCGTCGCC GAAATCCTCA CCGCGCCGGA CGCCAAGAAG
GTGTTCGATT CCTCGCAGAT CCCCGGCGAG ATCATCGACC TGATGGTCGC CAACACCGCG
GTGGTGAAGG ACAATCCGGA CTTCGCCAAG GCGCTGGTCG GGATCTGGTA CGAGACCATC
GCCAAGATGA CCGCGGCCGG CGCCGACGGC AAGGCCGCCA AGGAAGCGAT GGCCAAGGCC
TCCGGCACCG ATCTCGCCGG CTTCGACAGC CAGCTCGCCT CGACCAAATT GTTCGACAAG
CCGGCCGACG CCGAAGCCTT CACCAAGAGC GCCACGGTCG GCAAGACCAT GGACCGGGTG
CGCAAATTCC TGTTCGAGAA GGAACTGCTC GGCAAGGGCG CGAAGTCGGC CGATGCGGTC
GGCATCGAGC TCGGCGACAA GACCGTGCTC GGCGACAAGG CCAATGTGAA GCTGCGCTTC
GACGCCAGCT ACATGGACCT CGCCGCCAAG GGCAAGCTGT AA
 
Protein sequence
MVYRHSLFVF FTRGIAMNKL SSILRNMALA SAMIAVAVSS ADAAPKKDFK VAWSIYVGWM 
PWGYAADTGI VKKWADKYGI TIEVKQFNDY VESINQYTAG SYDAVTITNM DALSIPAAGG
VDTTALVMGD FSNGNDAVIL KGKSDLAAIK GQKVNLVEFS VSHYLLARAL ETKKLAEKDI
KVVNTSDADL AAAYKTPEVT AVVTWNPIVA EILTAPDAKK VFDSSQIPGE IIDLMVANTA
VVKDNPDFAK ALVGIWYETI AKMTAAGADG KAAKEAMAKA SGTDLAGFDS QLASTKLFDK
PADAEAFTKS ATVGKTMDRV RKFLFEKELL GKGAKSADAV GIELGDKTVL GDKANVKLRF
DASYMDLAAK GKL