Gene RPB_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2549 
Symbol 
ID3910338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2924051 
End bp2925151 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID637884447 
ProductABC transporter related 
Protein accessionYP_486164 
Protein GI86749668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.071435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG TGAACCTGCG CAAGGTGGTG AAGCGCTACG ACGACGTCGA GGCCGTCCGC 
GGCATCGATC TCGACATTCC GGACAAGGAA TTCGTGGTGT TCGTCGGCCC GTCCGGCTGC
GGCAAGTCGA CGACGCTGCG GATGATCGCC GGGCTCGAGG AAATCTCGGA CGGCGACATC
GTGATCGGCG GCGACGTCGT CAACGACGTT CCGCCGAAGG ACCGCGACAT CGCCATGGTG
TTCCAGAACT ACGCGCTGTA TCCGCACATG ACGGTCGCCG AGAACATGTC GTTCGGGTTG
CGGCTGAAGA AATATCCCAA GGCCGAGATC AAGCAGCGCG TCGACGAGGC GGCGCGGATG
CTGGACATCA CCGACCTGAT CCACCGCAAG CCGAAGCAGC TCTCCGGCGG CCAGCGCCAG
CGCGTTGCGA TGGGCCGCGC CATCGTCCGC AATCCCAAAG TGTTTTTGTT CGACGAGCCG
CTGTCGAATC TCGATGCGCA ATTGCGGGTG CAGATGCGGT TCGAGATCAA GCGGGTGCAC
CAGAAGGTGC GCACCACCAC GGTCTACGTC ACCCACGACC AGGTCGAGGC GATGACGCTG
GCGGATCGCG TCGTGGTGAT GAACAACGGC CGGATCGAGC AGGTCGGCAC GCCGAACGAG
CTGTATCACC GCCCGGCGAC GCGTTTCGTC GCCGGCTTCA TCGGCTCGCC GTCGATGAAT
TTCATTCCCT GTACGCTGGA GGACAATGCC GGGCGGCTGC AGATTCGGCT GTCGGACACG
CTGGCGATGC CGGTGCCGGA GCAGAAGGCG GCGCATTATC GTGGGCTCGC CCGCGACAAG
AAACTGCAGC TCGGCATCCG GCCGGAGCAC ATCGCCGACG CCAGAGCCAC GCTCGAACCC
GGCGTCGCGG CGTTCGACGC GCTGCTCGAT ATCACCGAGC CGATGGGGAT GGAGACGCTG
ATCTATTTCA ATCTCAACGG CAGCGAGGTC TGCGGCCGGG TCAGCCCGAA TGCCGGGGCG
CGGGATGGCG GCATGTTGCG TTTGGCGGTG GACCTCAACA ATATGCACCT GATAGACGAG
GGGACCGGCC TCGTGATCTG A
 
Protein sequence
MAEVNLRKVV KRYDDVEAVR GIDLDIPDKE FVVFVGPSGC GKSTTLRMIA GLEEISDGDI 
VIGGDVVNDV PPKDRDIAMV FQNYALYPHM TVAENMSFGL RLKKYPKAEI KQRVDEAARM
LDITDLIHRK PKQLSGGQRQ RVAMGRAIVR NPKVFLFDEP LSNLDAQLRV QMRFEIKRVH
QKVRTTTVYV THDQVEAMTL ADRVVVMNNG RIEQVGTPNE LYHRPATRFV AGFIGSPSMN
FIPCTLEDNA GRLQIRLSDT LAMPVPEQKA AHYRGLARDK KLQLGIRPEH IADARATLEP
GVAAFDALLD ITEPMGMETL IYFNLNGSEV CGRVSPNAGA RDGGMLRLAV DLNNMHLIDE
GTGLVI