Gene RPC_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4020 
Symbol 
ID3969210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4469269 
End bp4470141 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content67% 
IMG OID637927124 
ProductSulfate ABC transporter, permease protein CysW 
Protein accessionYP_533865 
Protein GI90425495 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4208] ABC-type sulfate transport system, permease component 
TIGRFAM ID[TIGR00969] sulfate ABC transporter, permease protein
[TIGR02140] sulfate ABC transporter, permease protein CysW 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.144703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.19248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAC AGCCGCAGCC TCCGATCAGA AAGACCGACT GGATGGTGAC GCCGGTCGGC 
GCCGGCCCGG TCGCCCGCCG CGTCGTCTTA AGCATCGTCG GCGTCACCAC GGCGCTGTTC
CTGCTGGCGC CGCTGGCGCT GATCGTCGCT TCGGCGTTCT CGCAAGGCGC CGGCGTGTTC
TTCCGCAGCC TCGGCGATCC GGGAACGCTG CACGCCATCA AGCTGACGCT GATCACCGCG
GCGATCGCGG TGCCGATGAA CATCCTGTTC GGCCTCGCCG CGGCCTGGAC CGTGACCAAA
TTCGTCTTCC CCGGCCGCAC CTTGCTGATC GCGCTGATCG AACTGCCGTA TTCGATTTCG
CCGATCGTCG CCGGCGTGGC GTTCCTTTTC GTCTACGGCT CGCAGGGGCT GTTCGGGCCA
CTCCTGGAGC AGCTCGGCCT CAAGGTGATG TTCGCGCTGC CGGCGATCGT GCTCGCCAGC
ATGTTCGTCA CCGCGCCGTT CGTGGCGCGC GAGCTGATCC CGTTGATGCA GGTGCAGGGC
ACCGACGAGG AGGAAGCCGC GGTGACGCTC GGCGCCTCCG GCTTTGCGAC TTTCGTCCGG
GTGACGCTGC CGAATATCCG CTGGGCGGTG CTGTACGGCG CCATCCTCTG CAACGCGCGG
GTGATGGGCG AATTCGGCGC GGTGTCGGTG GTGTCGGGCA ATATCCGCGG CCAGACCACC
ACGCTGCCGC TGCAGATCGA ACTTTTGTAC CAAGACTACA ACGTCGCCGG CGCCTTCGCC
GCAGCCACTG TGCTCACCGC GGTGGCGCTG TTGACCATCG TCATCAAGGC GGGGCTGGAG
CGGCTGGCCC GGGTCGAACA GGTCCAGCCC TGA
 
Protein sequence
MTKQPQPPIR KTDWMVTPVG AGPVARRVVL SIVGVTTALF LLAPLALIVA SAFSQGAGVF 
FRSLGDPGTL HAIKLTLITA AIAVPMNILF GLAAAWTVTK FVFPGRTLLI ALIELPYSIS
PIVAGVAFLF VYGSQGLFGP LLEQLGLKVM FALPAIVLAS MFVTAPFVAR ELIPLMQVQG
TDEEEAAVTL GASGFATFVR VTLPNIRWAV LYGAILCNAR VMGEFGAVSV VSGNIRGQTT
TLPLQIELLY QDYNVAGAFA AATVLTAVAL LTIVIKAGLE RLARVEQVQP