Gene RPC_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4007 
Symbol 
ID3969197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4455196 
End bp4456245 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID637927111 
Productsulphate transport system permease protein 1 
Protein accessionYP_533852 
Protein GI90425482 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.451546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATTG AGGTTCGCAA TATCGTAAAG CGCTTCGGCG CCTTCACCGC GCTGGACAAT 
GTCGACCTGC GGGTCGAGAC CGGCGAGCTG TTGGCGCTGC TCGGTCCCTC GGGCTCCGGC
AAGACCACGC TGCTGCGCAT CATCGCCGGG CTGGATTGGC CCGACGCCGG CTCGGTGTCG
TTCGACGGCG AGGACGCGCT GTCGCGCGGC GCCGGCGAGC GCCATGTCGG CTTCGTGTTC
CAGCACTACG CGCTGTTCCG CCATATGAGC GTGTTCGAGA ACGTCGCTTT CGGGCTGCGG
GTGCAGCCGC GCAGCATTCG TAAATCCGAG GCGCAGATCA AGGCGCGGGT CAAAGACCTG
TTGGACCTGG TGCAGCTCGA CTGGCTGGCC GACCGTTATC CGAGCCAGCT CTCCGGCGGC
CAGCGCCAGC GCATCGCGCT GGCGAGAGCG CTGGCGATCG AACCGCGCAT CCTGTTGCTC
GACGAGCCGT TCGGCGCGCT CGACGCCAAG GTGCGTAAAG AGCTGCGCAA ATGGCTGCGC
ACGCTGCACG AGGAAATCCA CGTCACCTCG ATCTTCGTCA CCCACGACCA GGAAGAGGCG
CTGGAAGTCG CCAACCGCGT GGTGGTGATG GACAAGGGCA GGATCGAACA GATCGGCAGC
CCCGGCGACG TCTATGAGGA CCCGGCGACC GCCTTCGTGC ACGGCTTCAT CGGCGAATCC
ATCGTGCTGC CGGTCGACAT CCGCGACGGA CGGGTGCGGC TCGGCGACCG CGAGCTCAAC
CTGGACTCTC GCGACGCCCG GCCGGGGCCG TCGAAACTGT TCATCCGCCG CCACGATCTG
GCGATCGGGC CGTCCGGCGC CGGCGCGCTG GAAGGTGCGG TGAAGCACGT CCGCGCCTTC
GGGCCGACGC AGCGCGCCGA CATCCTGCTC AGCGCGGTGG GCGAGGGCAC GCTGATCGAG
ATCGACGCGC CGCGCGATCG CGACCTGAAA CCCGGCGACA TCGTCAGCCT GCAGCCGCGC
CGCTACCGGA TTTTCGCCGA GACGGGTTAG
 
Protein sequence
MTIEVRNIVK RFGAFTALDN VDLRVETGEL LALLGPSGSG KTTLLRIIAG LDWPDAGSVS 
FDGEDALSRG AGERHVGFVF QHYALFRHMS VFENVAFGLR VQPRSIRKSE AQIKARVKDL
LDLVQLDWLA DRYPSQLSGG QRQRIALARA LAIEPRILLL DEPFGALDAK VRKELRKWLR
TLHEEIHVTS IFVTHDQEEA LEVANRVVVM DKGRIEQIGS PGDVYEDPAT AFVHGFIGES
IVLPVDIRDG RVRLGDRELN LDSRDARPGP SKLFIRRHDL AIGPSGAGAL EGAVKHVRAF
GPTQRADILL SAVGEGTLIE IDAPRDRDLK PGDIVSLQPR RYRIFAETG