Gene RPB_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3082 
Symbol 
ID3910883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3513218 
End bp3514825 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content62% 
IMG OID637884987 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_486692 
Protein GI86750196 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.49552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAGCC ACATCAACAG ACGAGTGTTT CTGGGTACGT CCGCCCTTGC GGCGATCGCC 
GCCGGAACGT CACTTTGCTT TCCACCTGCT GCACTCGCGC AGGAGAAGCC GAAGAAGGGC
GGCGTGCTGG TAGCGACCTG GGGTGGCTTC GAGCCGCAGG CCGTGTTCGT TCCCGGCGGC
GGTGGCTCCA GCCCGCTGAT CAGCTCGACC AAGATCCTGG AGCCGCTGCT GCGCCAGGAC
AGCCAGGCCG GCTTCCTTCC CGTGCTGGCC ACCGAGGTCA AGCCTTCGGC CGATTTCAAG
TCGTACGACA TCGTGCTCCG TAAAGGCGTG ACGTGGCACG ACGGCAAGCC GTTCACCGCA
GACGACGTGG TCTTCAGCAT CGAAAAATAC TGGCTGCAGA CGATCGCCAA GGCGGCGCTG
AAGAATTTCT CCGGGGCCGA GGTGACCGAA GGCGGGGTGC GCGTGAGCTT CAAAGAGCCG
ACGCCGGAGT TCTTCTTCAA GTCGGTGCTG GCGACCTCGC TGGTCATTCC GAAGCACGTC
TACGACGGCT CGGAGATCGT TACCAACCCC GCCAATAACG CGCCGATCGG CACCGGTCCG
TTCAAGTTCA AGCAGTGGGT GCGCGGCAGC CATATTGAAT ATGCCGCCAA CGACAAATAT
TGGGATGCGG GCAAGCCCTA TCTCAACGGC CTGGTGATGC GCTACTGGCG CGACGCCGCA
TCGCGCACCG CGGCCTTGGA AGCCGACGAA CTGCAGCTCG GCATCTTCAA CCCGATCCCG
ACCCCGGACA TCGACCGGCT GTCCAAGTCC GGCAAGTTCG TCGCATCGAA CGACGGCTAT
CTCGGCGCGG CCTGGGCTTC GACCATCGAA TTCAACAGCC GCCGTGATAT CGTCAAGGAC
CCGGCCGTGC GCCGCGCACT ACTCACCGCC ATCGACCGGG CGACCATCTC GGACGTTGTG
TATTTCGGCC GCGCCAAGCC CGGCACGTCC TTCGTCAGCT CGACAAACCC GAAATTCTAC
AATCCGAACC TGCCGCGATA CGAGTTCGAC GCGAAGAAAG CCGCCAAGAT GCTCGACGAT
GCCGGCTACC CGAAGAAGGG CAAGTCGCGC TTCAAGGTTC ACTTGCTTGC GGCCGGCTGG
TTCGAGGAGA ACGGCAAGGT CGGCCAGTTC GTGAAGCAGA ATCTGGAAGA CATCGGCGTC
GAAGTGACGC TCACCGTGCC CGACCGAGCG ACTTCGCTGA AGCAGATCTA CGGCGATTAC
GACTACGACA TCGCGCTGTC GAACTACGCG GCGCGGGTCG AGCTGGTTCC CCAGCAGACC
GACTACCTCA GCACGAGCGG GATTGTGAAG GGCGCGGCCT TCCGCAACGC CACCGGCTAT
TCCAATCCCG AAGTCGACGA CATCGTCGCC AAGATGTCGG TGGAGTCCGA CGAAGCCAAG
CGGAAGGACT TGGCCTTCAA ATTGCAAGAG ATTGCAGCGC GAGATCTGCC GATCACCGTT
CTGGTCGAAC TGATCCCGAC GACGATGATG TCGAAGAAGG TCAAGGGCGT CGGCAATCGC
GCCGATATTT CGGCGGACAG CCTGTCGGAC GTTTGGCTCG ACGTCTGA
 
Protein sequence
MTSHINRRVF LGTSALAAIA AGTSLCFPPA ALAQEKPKKG GVLVATWGGF EPQAVFVPGG 
GGSSPLISST KILEPLLRQD SQAGFLPVLA TEVKPSADFK SYDIVLRKGV TWHDGKPFTA
DDVVFSIEKY WLQTIAKAAL KNFSGAEVTE GGVRVSFKEP TPEFFFKSVL ATSLVIPKHV
YDGSEIVTNP ANNAPIGTGP FKFKQWVRGS HIEYAANDKY WDAGKPYLNG LVMRYWRDAA
SRTAALEADE LQLGIFNPIP TPDIDRLSKS GKFVASNDGY LGAAWASTIE FNSRRDIVKD
PAVRRALLTA IDRATISDVV YFGRAKPGTS FVSSTNPKFY NPNLPRYEFD AKKAAKMLDD
AGYPKKGKSR FKVHLLAAGW FEENGKVGQF VKQNLEDIGV EVTLTVPDRA TSLKQIYGDY
DYDIALSNYA ARVELVPQQT DYLSTSGIVK GAAFRNATGY SNPEVDDIVA KMSVESDEAK
RKDLAFKLQE IAARDLPITV LVELIPTTMM SKKVKGVGNR ADISADSLSD VWLDV