Gene RPB_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1048 
Symbol 
ID3908900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1204880 
End bp1205926 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content69% 
IMG OID637882941 
Productsulphate transport system permease protein 1 
Protein accessionYP_484669 
Protein GI86748173 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTG AAGTCCGCAA CATCGTCAAG CAATTCGGCA GTTTCCGCGC GCTCGACAAT 
GTCGACCTGC GGGTCGAGAC CGGCGAGCTG ATGGCGCTGC TCGGCCCCTC CGGCTCCGGC
AAGACCACGC TGCTGCGGAT CATCGCCGGG CTGGAATGGC CCGACGCCGG CTCGATCGCG
TTCGACGGCG AGGACGCGCT GGCGCGCGGC GCCGCCGAGC GCCATGTCGG CTTCGTGTTC
CAGCACTACG CGCTGTTCCG GCACATGAGC GTGTTCGAGA ACGTCGCCTT CGGTCTGCGG
GTGCAGCCGC GCAAGATCCG CAAGAGCGAG GCGGAGATCA GAAAGCGCGT CGGCGATCTG
CTCGATCTGG TGCAGCTCGG CTGGCTCGCC GACCGCTATC CGAACCAGCT CTCCGGCGGC
CAGCGCCAGC GCATCGCGCT CGCCCGCGCG CTGGCGATCG AGCCGCGCAT CCTGCTGCTC
GACGAGCCGT TCGGCGCGCT CGACGCCAAG GTGCGCAAGG AACTACGCGC CTGGCTGCGC
AATCTGCACG AGGAGATCCA CGTCACCTCG ATCTTCGTCA CCCACGATCA GGAAGAGGCG
CTCGAAGTCG CCAACCGCGT GGTGGTGATG GACAAGGGCC GGATCGAACA GATCGGCTCG
CCCGGCGACG TCTACGAGCG CCCGGCCTCG GCCTTCGTGC ACGGCTTCAT CGGCGAATCC
ATCGTGCTGC CGGTCGAGGT GCGCGACGGC CGCGTGCGAT TGGGCGACCG CGTGCTCGAT
CTGGCGCCGA CCGACACGGC CTCCGGCCCG TCGAAACTGT TCGTCCGCCG CCACGATGTC
GCGGTCGGCC CCAGCGGCAG CGGCGTGTTC GAGGGCGCGG TCAAGTCGGT GCGCGCGTTC
GGCCCGATGC AGCGCGCCGA TATCGTGCTG CAAGGCGTCG GCGGCGACAC GCTGGTCGAG
ATCGACGCGC CGCGCGACCA CTCACTCAAG GTCGGCGACC GCATCGGCCT GCAGCCGCAG
CGCTACCGGA TTTTCGCCGA TCGCTGA
 
Protein sequence
MTIEVRNIVK QFGSFRALDN VDLRVETGEL MALLGPSGSG KTTLLRIIAG LEWPDAGSIA 
FDGEDALARG AAERHVGFVF QHYALFRHMS VFENVAFGLR VQPRKIRKSE AEIRKRVGDL
LDLVQLGWLA DRYPNQLSGG QRQRIALARA LAIEPRILLL DEPFGALDAK VRKELRAWLR
NLHEEIHVTS IFVTHDQEEA LEVANRVVVM DKGRIEQIGS PGDVYERPAS AFVHGFIGES
IVLPVEVRDG RVRLGDRVLD LAPTDTASGP SKLFVRRHDV AVGPSGSGVF EGAVKSVRAF
GPMQRADIVL QGVGGDTLVE IDAPRDHSLK VGDRIGLQPQ RYRIFADR