Gene RPC_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1931 
Symbol 
ID3973563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2102321 
End bp2103313 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content66% 
IMG OID637925042 
ProductABC sulfate transport system, periplasmic binding protein 
Protein accessionYP_531807 
Protein GI90423437 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.10366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCG TTGCGACACG CCGCTCACTC ATGGTCGGAC TAGGCTCCTT GGCGCTGGCC 
GGCTTGGCGC CGCGTCGCGC ATTGTCCACG CCGTCCCGCG GGCTGCAGAT TTTGGGAGCA
CCGAACGGCT CGACCGTCGT GCTGGTCGAT CTGATCGAAT CCGGGGCGCT GGCGGCGGCG
GCGCCGGACG TGAGCTTCCG GCTGTGGCGA ACCACCGACG ATCTGCGCGC CGGCATCGTC
TCCGGCAATA CCAAGATCTT TTCGACGCCG AGCCATGTGC CCGCCAACCT CGCGAGCCGC
GGCATGCCGC TGAAGATGCT TTGCCTGCTC GGCATGGGGC ATCTGTCGGT GATCACCAGC
GACGACAGCA TCCGGGATTT TCATGACCTG GCCGGCAAGC CGATGCTCGG CTTCTTCCGC
AACGACATGC CGGATTTGAC CTTCCGGGCG ATCGCAAAAA TGGAAGGGCT CGATCCCGAC
AAGGACATTC AGCTGAGCTA CGTGCAAACC GCGATGGAAG CCGCGCAGAT GCTCGCCGCC
GGCCGCGCCA CCACCGCGAT CCTGTCCGAG CCGCCGGCCA CCGCCGCCAT GGTGATGGCC
GCGCAGCAGG AGCGCAAATT GCGTCGCGCC TTCGAACTCA CCACGATCTG GGGCCGACAC
AAGCCGAAGC CGCGGATTCC GATGGCGGGG ATCGCGCTGC ATGCCAGCCT GCTCGACGAC
GCGCCGGATT TGGTTGCAGC ATTGCGTGCC GGGCTGTTGC CGGCCAAGCA GCGCGTGCTG
GCCGATCCCG CGGCGGCGGC GAAGCTCGCC GAACGCCGCA TGGAGATGCG GCCGCAGATT
TTTGAGAAGG CGTTTCCCTA TATGCATATC GACGTGGTGT CGGCGAAGGA GGCCAAGGCC
GAACTGATCG ACTTCTACAC CACGCTGCTC GCGCTCGAGC CGGAAGCGCT CGGCGGCAAG
CTGCCGCCCG ACGATTTCTA TCTCGACCTC TGA
 
Protein sequence
MPAVATRRSL MVGLGSLALA GLAPRRALST PSRGLQILGA PNGSTVVLVD LIESGALAAA 
APDVSFRLWR TTDDLRAGIV SGNTKIFSTP SHVPANLASR GMPLKMLCLL GMGHLSVITS
DDSIRDFHDL AGKPMLGFFR NDMPDLTFRA IAKMEGLDPD KDIQLSYVQT AMEAAQMLAA
GRATTAILSE PPATAAMVMA AQQERKLRRA FELTTIWGRH KPKPRIPMAG IALHASLLDD
APDLVAALRA GLLPAKQRVL ADPAAAAKLA ERRMEMRPQI FEKAFPYMHI DVVSAKEAKA
ELIDFYTTLL ALEPEALGGK LPPDDFYLDL