Gene RPD_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1378 
Symbol 
ID4021855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1547688 
End bp1548695 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content63% 
IMG OID637961571 
Productthiosulphate-binding protein 
Protein accessionYP_568517 
Protein GI91975858 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4150] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGTT TCGCCCTTGC ATTGTTCGCT GGTGTCGGCG CGCTCGTAAC CGCGCTGCCT 
GCGCAGGCGC AGACGTCGAC CACGCTGCTC AACGTCTCCT ACGATATCTC CCGCGAGCTC
TACGCCGAGA TCAACGCGGC GTTCATTCCG CAATGGAAGG CGAAAACCGG ACAGGACATC
GCCATCAACC AGTCGCATGG CGGCTCGTCC CGGCAGGCGC GCTCGATCCT CGAAGGGCTC
GAAGCCGACG TGGTGACCTT CAACCAGGTC ACCGACGTCC AGGTGCTGCA TGACAAAGGC
AAGCTGATCC CGGCCGACTG GGCGAAGCGG CTACCGAACA ATTCCTCGCC GTATTATTCG
CTGCCGGCGT TTCTGGTTCG CGCCGGCAAC CCGAAAGGGA TCAAGGATTG GGACGATCTG
GTGAAGCCGG ACGTCAAGGT GATCTTCCCG AATCCGAAGA CCTCGGGCAA CGGCCGCTAT
ACCTATCTGG CGGCCTATGC CTTCGCGAAG CAGAAATACG GCAACGAGGC GGAGGCCGAC
GCGTTCATCA AGAAGCTGTT CGCCAATGTG CCGGTGTTCG ACACCGGCGG CCGCGCCGCG
ACGACGACCT TCGTCGAGCG TCAGACCGGC GACGTGCTGA TCACTTTCGA GGCCGAAACC
AGCTCGATCC GCGACCTCGC CGGAGCCGAC AAGTATCAAG TCGTGGTGCC GCCGACCAGC
CTGCTGGCCG AATTCCCGGT CAGCGTGGTC GACAAATACG CCGACAAGCA CGGTACCCGC
GCGCTCGCCA CCGCCTATCT CGAATATCTG TATTCGCCCG AGGGCCAGAC CATCCTCGCC
AAGGCGTATA ACCGCGTGCA AGACAAGGCC GTGATCGAGA AGTTCAAGGA CAAGTTCCCG
GAGGTGAAGC TGTACCGGGT CGAGGACGAA TTCGGCGGCT GGGACAGGCT CAACGCCGCG
CACCTCGCCT CCGGCGCCAA GCTCGATCAG CTGTTCGGCG GACGGTGA
 
Protein sequence
MNRFALALFA GVGALVTALP AQAQTSTTLL NVSYDISREL YAEINAAFIP QWKAKTGQDI 
AINQSHGGSS RQARSILEGL EADVVTFNQV TDVQVLHDKG KLIPADWAKR LPNNSSPYYS
LPAFLVRAGN PKGIKDWDDL VKPDVKVIFP NPKTSGNGRY TYLAAYAFAK QKYGNEAEAD
AFIKKLFANV PVFDTGGRAA TTTFVERQTG DVLITFEAET SSIRDLAGAD KYQVVVPPTS
LLAEFPVSVV DKYADKHGTR ALATAYLEYL YSPEGQTILA KAYNRVQDKA VIEKFKDKFP
EVKLYRVEDE FGGWDRLNAA HLASGAKLDQ LFGGR