Gene RPB_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2054 
Symbol 
ID3909869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2335842 
End bp2337230 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content62% 
IMG OID637883947 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_485672 
Protein GI86749176 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.568287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGT TCGACAATCC GTTCGATCCC AACCGTCGCC TCACCGCAGG CGGATGTAGC 
TGCGGCCGGC ACGTCAACGA AGCCGAACAT GCGGCGGATC CGTCGTCGGC GCTGCAGCCG
ACAATGCTGG AGAGCGACGA CAAGAAGTTC GAGGGCGTGG TCGCCTCCGC GGTGATGCGG
GCGATGTTTC CGCAAGACGC CTCGCGGCGC GCCTTTCTGA AATCGGTCGG CGCCGCAACC
GCTCTGGCGG CGATCTCGCA GTTCTTTCCG CTACAGACCG CCACCGAGGC GTTCGCCTCC
GGCGGTCCGC TGGAGAAGAA GGACCTCAAG GTCGGCTTCA TCCCGATCAC CTGCGCCACG
CCGATCATCA TGGCCGCCCC GATGGGGTTC TATTCGAAAT ACAGCCTCAA CGTCGAAGTC
ATCAAGACCG CGGGTTGGGC GGTGATCCGC GACAAGACCA TCAACAAGGA ATACGACGCC
GCGCACATGC TGTCGCCGAT GCCGCTCGCC ATCACCATGG GCGTCGGCTC GAATCCGATC
CCCTACACCA TGCCGGCGGT CGAGAACATC AACGGCCAGG CCATCACTTT GGCGATGAAG
CACAAGGACA AGCGCAATCC GAAGGATTGG AAGGGATTCA AATTCGCGGT CCCGTTCGAC
TATTCGATGC ACAACTATCT GCTGCGCTAT TATCTCGCCG AACACGGCCT CGATCCCGAC
GTCGACGTGC AGATCCGCGC GGTGCCGCCG CCGGAAATGG TCGCCAATCT GCGCGCCGAC
AATATCGACG GCTATCTCGC GCCCGACCCG ATGAACCAGC GCGCGGTGTA TGACGGCGTC
GGCTTTATCC ACATCCTGAC CAAGGAGATC TGGGACGGCC ACCCGTGCTG CGCCTTCGCC
GCGTCGAAGG AATTCGTCAC CACGATGCCC AACACCTACG GCGCGCTCTT GAAATCGATC
ATCGAGGCCA CCGCCTACGC CCACAAGCCG GAGAACCGCA AGGAGATCGC CGCCGCGATC
GCGCCGGCCA ACTACCTGAA CCAGCCCGCG ATCGTGCTGG AGCAGATCCT CACCGGCACC
TATGCGGACG GCCTCGGCAA CATCATCAAG CAGCCGAACC GGATCGACTT CGACCCGTTC
CCCTGGCAGT CCTTCGCGGT CTGGATCATG ACCCAGATGA AGCGCTGGGG ACAGGTCAAG
GGCGACGTCG ACTACAAGGC GATCGCCGAG CAGGTCTATC TGGCGACCGA CACCGCGAAA
CTGATGAAGG AAGCGGGCCT CACCCCGCCG ACCACGACCT CGCGGTCGTT CTCGGTGATG
GGCAAGTCGT TCGACGGCTC GAATCCGGAA GAATATCTCG CGAGCTTCAA GATCAAGAAG
GCCTCGTGA
 
Protein sequence
MSTFDNPFDP NRRLTAGGCS CGRHVNEAEH AADPSSALQP TMLESDDKKF EGVVASAVMR 
AMFPQDASRR AFLKSVGAAT ALAAISQFFP LQTATEAFAS GGPLEKKDLK VGFIPITCAT
PIIMAAPMGF YSKYSLNVEV IKTAGWAVIR DKTINKEYDA AHMLSPMPLA ITMGVGSNPI
PYTMPAVENI NGQAITLAMK HKDKRNPKDW KGFKFAVPFD YSMHNYLLRY YLAEHGLDPD
VDVQIRAVPP PEMVANLRAD NIDGYLAPDP MNQRAVYDGV GFIHILTKEI WDGHPCCAFA
ASKEFVTTMP NTYGALLKSI IEATAYAHKP ENRKEIAAAI APANYLNQPA IVLEQILTGT
YADGLGNIIK QPNRIDFDPF PWQSFAVWIM TQMKRWGQVK GDVDYKAIAE QVYLATDTAK
LMKEAGLTPP TTTSRSFSVM GKSFDGSNPE EYLASFKIKK AS