Gene RPB_0942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0942 
Symbol 
ID3909796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1089488 
End bp1090711 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content64% 
IMG OID637882835 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_484563 
Protein GI86748067 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG ACAACAAACA CGCGTTCACC CGCCGCCGTT TCCTCTCCAA CTTCGCCTTC 
GCCGGCACCG CGATCGCCAC CGGCGTCGGC AGCTGGGTCG TCCCGGCCGG CTGGGCCAAC
GCCGCCGCCG GCCCGATCAA GGTCGGCATC GGCACCGACC TCACCGGCCC GATGGGTTAT
GCCGGGAACG CCGATGCCAA CGTCGCCAAG ATGGTGATCA AGCAGATCAA CGACGCCGGC
GGCCTGCTCG GCAGGCCGAT CGAACTCTTC ATCGAGGACA CCGCGTCCAA CGAGGCCGTC
GCGGTCGGCA ACGTCCGCAA GCTGATCCAG CGCGACAAGG TCGACCTCGT GGTCGGCGGC
ATCACCTCGT CGATGCGCAA CGCCATCAAG GACGTCATCG TGTCGCGCGG CAAGACACTC
TACATCTATC CGCAACTCTA CGAAGGCAAG GAATGCACGC CGAACCTGTT CTGCACCGGC
CCGACCCCGG CGCAGCAGTG CGACGAGTTC ATCCCGTGGC TGATCAAGAA CGGCGGCAAG
AAATTCGCGC TGCCCTCCGC CAATTATGTC TGGCCGCACA CGCTCAACGT CTATGCCCGC
AAGGTGATCG AGGCCAATGG CGGCGAAGTG GTGCTCGAGG AGTACTACCC GCTTGATCAG
ATCGACTTCT CGTCGACCGT CAACCGCATC ATCTCCAACA AGGTCGATGT GGTGTTCAAC
ACCGTGATCC CGCCCGGTGT CGGCCCGTTC TTCAAGCAGC TTTATGAAGC GGGGTTCCTC
AAGAACGGCG GCCGGCTCGC CTGCGTGTAT TATGACGAGA ACACGCTCGG CATCAATCAG
CCGGCGGAGA TCGAAGGGCT GGCGAGCTGC CTCGACTACT TCAAGGCGCT CACCAAGGAC
GAGCCGTTCT CCGCCAAGCT GCAGGCCGAC TACGAAAAGG CGTTCCCGGG CAACTTCCTG
TTCGCGGCCG GCAGCGCCGC CACCGGCACC TATCGGGCCC TCAAGCTGTG GGAAGCCGCG
GTGAAGGAAG CCGGCAAGAT CGACCGCGAC GGCGTCGCCG CGGCGCTCGA TCACGCCAAG
ATCGCCGAAG GCCCGGGAGG CCCCGCCGAG ATGGTCCCCG GCAAACGCCA CTGCAAGATG
AACATGTACA CCGCCGTCGC CAAGAACGGC AGCTACGAGA TCGTCGAGCG CAGCAAGGGG
CTGGTGGATC CGAAGGAATG CTGA
 
Protein sequence
MSSDNKHAFT RRRFLSNFAF AGTAIATGVG SWVVPAGWAN AAAGPIKVGI GTDLTGPMGY 
AGNADANVAK MVIKQINDAG GLLGRPIELF IEDTASNEAV AVGNVRKLIQ RDKVDLVVGG
ITSSMRNAIK DVIVSRGKTL YIYPQLYEGK ECTPNLFCTG PTPAQQCDEF IPWLIKNGGK
KFALPSANYV WPHTLNVYAR KVIEANGGEV VLEEYYPLDQ IDFSSTVNRI ISNKVDVVFN
TVIPPGVGPF FKQLYEAGFL KNGGRLACVY YDENTLGINQ PAEIEGLASC LDYFKALTKD
EPFSAKLQAD YEKAFPGNFL FAAGSAATGT YRALKLWEAA VKEAGKIDRD GVAAALDHAK
IAEGPGGPAE MVPGKRHCKM NMYTAVAKNG SYEIVERSKG LVDPKEC