Gene RPB_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2096 
Symbol 
ID3908510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2382584 
End bp2383648 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content66% 
IMG OID637883989 
ProductABC transporter related 
Protein accessionYP_485713 
Protein GI86749217 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00542123 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGTCGG TGCAGATTCA CGACGTGCGG AAATCATTCG GCGGCTTCGA AGTCCTGCAT 
GGCGTGACGG TTCCGATCGA GGACGGCGCC TTCGTGGTGC TGGTCGGCCC TTCGGGCTGC
GGCAAGTCGA CTTTGCTGCG AATGCTCGCG GGGCTGGAAA AAATCACTTC CGGGACGATC
TCGATCGGCG ACCGCATCGT CAACGACGTG CAGCCGAAGG AACGCGACAT CGCGATGGTG
TTCCAGAACT ACGCGCTGTA TCCGCACATG ACCGTCGCCC AGAACATGGG CTTCTCGCTC
AAGCTGCGCG GTGCCGACCA GAAGGCGATC GACGACAAGG TCAATCGCGC CGCCGACATT
CTCGATCTGC GCAAACTGCT CGACCGCTTC CCGCGGCAGC TCTCCGGCGG CCAGCGCCAG
CGCGTCGCGA TGGGCCGGGC GATCGTGCGC GATCCGCAGG TGTTCCTGTT CGACGAGCCG
CTGTCGAATC TCGACGCCAA GCTGCGCGTG GCGATGCGCA CCGAAATCAA GGAGCTGCAT
CAGCGGCTGA AGACCACGAC GGTGTACGTC ACCCACGACC AGATCGAGGC GATGACCATG
GCCGACAAGA TCGTGGTGAT GCAGGACGGC ATCGTCGAGC AGATCGGCGC ACCGCTCGAT
CTCTACGACA ACCCCGCCAA CAAATTCGTC GCCGGCTTCA TCGGCTCGCC GGCGATGAAC
TTTCTCGACG GCACGCTGAC GGTCGATGGC GGCCAGCCCT TCGTCGAGAC CGCGAACGGC
GCGCGGCTGC CGATCACCGA GGCGCCGGCG GGCGGCAACG GGCGTCCGAT CACTTACGGC
ATCCGCCCCG AGCATCTCGA CTTCGCCGAC ACCGGCATCG CGGCGGAGGT GGTGGTGGTC
GAGCCGACCG GATCGGAAAC CCAGATCGTC GCCCGCGTCG GCGCGCAGGA GATCATCGCG
GTGTTTCGCG AGCGGCACCG GGTGCAGCCC GGTGACGTCA TCCATCTGCA GCCGCGGCCG
CAGGTCGCTC ATCTGTTCGA CAGGGAGACC GGCGCGCGGC TCTGA
 
Protein sequence
MASVQIHDVR KSFGGFEVLH GVTVPIEDGA FVVLVGPSGC GKSTLLRMLA GLEKITSGTI 
SIGDRIVNDV QPKERDIAMV FQNYALYPHM TVAQNMGFSL KLRGADQKAI DDKVNRAADI
LDLRKLLDRF PRQLSGGQRQ RVAMGRAIVR DPQVFLFDEP LSNLDAKLRV AMRTEIKELH
QRLKTTTVYV THDQIEAMTM ADKIVVMQDG IVEQIGAPLD LYDNPANKFV AGFIGSPAMN
FLDGTLTVDG GQPFVETANG ARLPITEAPA GGNGRPITYG IRPEHLDFAD TGIAAEVVVV
EPTGSETQIV ARVGAQEIIA VFRERHRVQP GDVIHLQPRP QVAHLFDRET GARL