Gene RPD_4292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4292 
Symbol 
ID4024815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4753184 
End bp4754422 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID637964500 
Productputative urea/short-chain amide transport system substrate-binding protein 
Protein accessionYP_571410 
Protein GI91978751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03669] urea ABC transporter, substrate-binding protein, archaeal type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0103855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCGA GACGACTGCT CCAAACTGCT TTTCTCGGCC TCGCGCTCGG CGCCATTGCG 
CCGTTGGCGA CGCAGGCGGC CGAACCGCCG CTGAAGGTCG GCCTGCTCGA AGACATCTCC
GGCGATCTCG CCTTCATGGG TATGCCGAAG TTGCACGGCT CGCAGCTCGC GGTCGAGGAA
ATCAACAAGA GCGGCGGCAT CCTCGGCCGG CAGATCGAGC TGATCCATCT CGACCCGCAG
GGCGACAACG CCCGTTACCA GGAGTTTGCC CGGCGGCTGC TCAATCGCGA CAAGGTCGAC
GTCCTGATCG GCGGCATCAC CTCGGCGGCG CGCGAAGCAT TGCGTCCGAT CGTCGGCCGC
ACCTCGACGC CGTATTTCTA CACGAACCAG TATGAAGGCG GCGTCTGCGA CGCCAGCATG
ATCAGCATGG GCGCGGTGCC CGAGCAGCAG TTCTCGACGC TGGTTCCCTG GATGGTGGAG
AAGTTCGGCA AGAAGGTCTA CGTCGTCGCC GCCGACTACA ATTTCGGCCA GATCTCGGCG
GAATGGAACC GCAAGATCAT CAAGGATCTC GGCGGGCAGG TGGTCGGCGA GGAGTTCATC
CCACTCGGAG TCTCGCAATT CGCGCAGACC ATCCAGAACA TCCAGAAGGC GAAGCCCGAC
TGGTTGCTGA CGATCAATGT CGGCGCCGCG CAGGATTCGT TCTTCGAACA GGCGGCCGCG
GCCAATCTCA ATCTGCCGAT GGGGTCGTCG ATCAAGGTGA TGCTCGGCTT CGAGCACAAG
CGCTTCAAGC CGCCGGCGCT CAACAACATG CACGCCACCG CGAACTGGTT CGAGGAAATC
GCCACGCCCG AGGCGGAGGC TTTCAAGAAG CGCTGGCGCG CCAAGTTCCC CGACGAAACC
TACATCAACG ACATGGGCTA CAACGCCTAC AACGCGCTGT ACATGTACAA GACGCTGGCG
GAAAAGGCGA AGTCGACCAA GCTCGAAGAC CTCCGCAAGG TGATCGCGAC CGGCGAAGCC
TGCATCGATG CGCCCGAAGG CAAGGTCTGT ATCGATCCGA AGAGCCAGCA CACGTCGCAC
CGGATGCGTC TGATCTCGGT CGGACCCAAG CACGACGTCA CGGTCGTCAA GGACTACGGC
ACGATCCAGC CCTACTGGCT CGGCGAGGTC GGCTGCGACC TCACCAAGAA GAACGACAAG
GAACAGTACA CGCCCAATCA GCTGCCGAAG AAGTCGTGA
 
Protein sequence
MRSRRLLQTA FLGLALGAIA PLATQAAEPP LKVGLLEDIS GDLAFMGMPK LHGSQLAVEE 
INKSGGILGR QIELIHLDPQ GDNARYQEFA RRLLNRDKVD VLIGGITSAA REALRPIVGR
TSTPYFYTNQ YEGGVCDASM ISMGAVPEQQ FSTLVPWMVE KFGKKVYVVA ADYNFGQISA
EWNRKIIKDL GGQVVGEEFI PLGVSQFAQT IQNIQKAKPD WLLTINVGAA QDSFFEQAAA
ANLNLPMGSS IKVMLGFEHK RFKPPALNNM HATANWFEEI ATPEAEAFKK RWRAKFPDET
YINDMGYNAY NALYMYKTLA EKAKSTKLED LRKVIATGEA CIDAPEGKVC IDPKSQHTSH
RMRLISVGPK HDVTVVKDYG TIQPYWLGEV GCDLTKKNDK EQYTPNQLPK KS