Gene RPD_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1159 
Symbol 
ID4021635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1318574 
End bp1319620 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID637961351 
Productsulphate transport system permease protein 1 
Protein accessionYP_568298 
Protein GI91975639 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.940545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTG AAGTCCGCAA TATCGTCAAG GAATTCGGCA GCTTCCGCGC GCTCGACAAT 
GTCGACCTGC GGGTCGAGAC CGGCGAGCTG ATGGCGCTGC TCGGCCCCAG CGGCTCCGGC
AAGACCACGC TGCTGCGGAT CATCGCCGGG CTGGAATGGC CCGACGCCGG GTCGATCGCG
TTCGACGGCG AGGACGCGCT GGCGCGCGGC GCGGCCGAGC GGCATGTCGG CTTCGTGTTC
CAGCACTACG CGCTGTTCCG GCACATGAGT GTGTTCGAGA ACGTCGCCTT CGGGCTGCGG
GTGCAGCCGC GCAGGATCCG CAGGAGCGAG GCGGAGATCA GGAAGCGTGT CGGCGATCTG
CTCGATCTGG TGCAACTCGG CTGGCTGGCC GACCGCTATC CCAACCAGCT CTCAGGTGGC
CAGCGCCAGC GTATCGCGCT GGCCCGCGCG CTGGCGATCG AGCCGCGCAT CCTGCTGCTC
GACGAGCCGT TCGGCGCGCT CGACGCCAAG GTCCGCAAGG AGCTGCGCGC CTGGCTGCGC
AATCTGCACG AGGAGATCCA CGTCACCTCG ATCTTCGTCA CCCACGATCA GGAGGAGGCG
CTCGAAGTCG CCAACCGGGT GGTAGTGATG GACAAGGGCA AGATCGAACA GATCGGCTCG
CCGGGCGACG TCTATGAGCG CCCCGCCTCC GCCTTCGTGC ACAGCTTCAT CGGCGAATCC
ATCGTACTGC CGGTCGAGGT CCGCGACGGG CGGGTTCAAC TCGGCGACCG CGTGCTCGAT
CTCGCGCCGC CCGAGACCGG GGGCGGTCCG TCGAAGCTGT TCGTTCGCCG CCACGACATC
GCGGTGGGGC CGAGCGGCAG CGGCGTGTTC GAAGGCGCGG TCAGGTCGGT GCGCGCGTTC
GGCCCGATGC AGCGCGCCGA TATCCTGCTG CAGGGCGTCG ACGGTGACAT GCTGGTCGAG
ATCGACGCGC CGCGCGACCA TTCGCTCAAG GTCGGCGACC GGATCGGCCT GCAGCCGCAG
CGCTACCGGA TCTTCGCTGA TCACTGA
 
Protein sequence
MTIEVRNIVK EFGSFRALDN VDLRVETGEL MALLGPSGSG KTTLLRIIAG LEWPDAGSIA 
FDGEDALARG AAERHVGFVF QHYALFRHMS VFENVAFGLR VQPRRIRRSE AEIRKRVGDL
LDLVQLGWLA DRYPNQLSGG QRQRIALARA LAIEPRILLL DEPFGALDAK VRKELRAWLR
NLHEEIHVTS IFVTHDQEEA LEVANRVVVM DKGKIEQIGS PGDVYERPAS AFVHSFIGES
IVLPVEVRDG RVQLGDRVLD LAPPETGGGP SKLFVRRHDI AVGPSGSGVF EGAVRSVRAF
GPMQRADILL QGVDGDMLVE IDAPRDHSLK VGDRIGLQPQ RYRIFADH