Gene RPC_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4018 
Symbol 
ID3969208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4467088 
End bp4468095 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content63% 
IMG OID637927122 
Productthiosulphate-binding protein 
Protein accessionYP_533863 
Protein GI90425493 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4150] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.206129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.132982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGTC TTGCCCTCGC CTTCATCGCC GGTCTCGGCG CCCTCACCGC GGTGTCCGCG 
GCGTGGGCGC AGACGCCGGC CACGCTGCTC AACGTCTCCT ACGACATTTC GCGCGAGCTC
TATGTCGAGA TCAACGCCGC CTTCACCAAG CAATGGAAGG CCAAGACCGG CCAGGACGTC
ACCATCAACC AGTCGCACAA CGGCTCGTCG CGGCAGGCCC GCTCGATCCT CGAAGGGCTC
GAGGCCGACG TCGTGACCTT CAACCAGGTC ACCGACGTGC AGGTGCTGTA CGACAAGGGC
AAGCTGATCC CGGCGGACTG GGCCAAGCGG CTGCCGAACA ATTCCTCGCC GTATTACTCG
CTGCCGGCAT TCCTGGTGCG CGCCGGAAAT CCCAAGGCCA TCAAGGATTG GGACGATCTG
GTGAAGCCCG GCGTGCAGGT GATCTTCCCC AACCCGAAGA CCTCGGGCAA TGCCCGCTAC
ACCTATCTCG CCGCCTATGC CTTCGCGAAG CACAAGTACG GCAACGAGGC CGAGGCTGAT
GCCTTCATCA AGAAGCTGTT CGGCAACGTG CCGGTGTTCG ACACCGGCGG TCGCGCCGCC
ACCACCACCT TCATCGAGCG GCAGACCGGC GACGTGCTGA TTTCCTTCGA AGCCGAGACC
AGCGCGATCC GCGACATCGC CGGCAAGGAC AAGTACCAAG TGGTGGTGCC GCCGACCAGC
CTCTTGGCGG AATTCCCGGT CAGCGTGGTC GACAAATACG CCGACAAGCA CGGCACCAGG
CCGCTCGCCA CCGCCTATCT GGAATATCTT TACTCGCCGG AGGGACAGAC CATTTTGGCC
AAGGCCTATA ACCGGGTGAA CGACAAGGCC GTGGCGGAGC AATTCAAGGA CAAGTTCCCC
GAGGTCAAGC TGTACCGGGT CGAGGACGAA TTCGGCGGCT GGGACAAGCT CACCGCCGAT
CACCTCGCCT CCGGCGCCAA GCTCGATCAA TTGTTCGGCG GACGCTAG
 
Protein sequence
MNRLALAFIA GLGALTAVSA AWAQTPATLL NVSYDISREL YVEINAAFTK QWKAKTGQDV 
TINQSHNGSS RQARSILEGL EADVVTFNQV TDVQVLYDKG KLIPADWAKR LPNNSSPYYS
LPAFLVRAGN PKAIKDWDDL VKPGVQVIFP NPKTSGNARY TYLAAYAFAK HKYGNEAEAD
AFIKKLFGNV PVFDTGGRAA TTTFIERQTG DVLISFEAET SAIRDIAGKD KYQVVVPPTS
LLAEFPVSVV DKYADKHGTR PLATAYLEYL YSPEGQTILA KAYNRVNDKA VAEQFKDKFP
EVKLYRVEDE FGGWDKLTAD HLASGAKLDQ LFGGR