Gene RPD_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1045 
Symbol 
ID4021521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1198041 
End bp1199264 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID637961237 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_568184 
Protein GI91975525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.457214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG ACTCACCCCA CAATCTCACC CGCCGCCGCT TCCTCTCCAA CTTCGCCTTC 
GCGAGCACAG GGCTGGCGAC CGGCGTCGGC AGCTGGGTGG TGCGGCCCGA TTGGGCCAAC
GCCGCCGCCG GCGCGATCAA GGTCGGCATC GCCACCGACC TGACCGGCCC GATGGGTTAC
GCCGGCAACG CCGACGCCAA CGTCGCCAAG ATGGTGTTGA AGCAGATCAA CGACGCCGGC
GGCCTGCTCG GCCGTCCTTT GGAGCTCTAC ATCGAGGACA CCGCCTCCAA CGAAGCGGTC
GCGGTCGGCA ACGTCCGCAA GCTGATCCAG CGCGACAAGG TCGATCTCGT GCTCGGCGGC
ATCACCTCGT CGATGCGCAA TGCGATCAAG GACGTCATCG TCGCACGCGG AAAGACGCTG
TACATTTATC CACAGCTTTA CGAAGGCAAG GAATGCACGC CCAACCTGTT CTGCACCGGA
CCGACCCCGG CGCAGCAGTG CGATGAATTC ATCCCGTGGC TGATCAAGAA CGGCGGCAAG
AAATTCGCGC TGCCGAGCGC CAATTACGTC TGGCCGCACA CGCTCAATGT CTATGCCCGC
AAGGTGATCG AGGCCAATGG CGGCGAGGTC GTGCTGGAGG AATACTACCC GCTCGACCAG
ATCGACTTCT CATCGACGGT CAACCGAATC ATCTCCAACA AGGTCGACGT CGTATTCAAT
ACCGTGATCC CGCCGGGCGT CGGTCCGTTC TTCAAGCAAC TCTATGAGGC CGGCTTCCTC
AAGAACGGCG GCAGGCTGGC CTGCGTCTAC TACGACGAGA ATACGCTCGG CATCAATCAG
CCTGCGGAGA TCGAGGGCCT CGCGAGCTGC CTCGACTACT TCAAGGCCGT CGCCAAGACC
GATCCGGTCA GCGCTAAAAT CCAGGCGGAA TACGACAAGG CCTACCCTGG CAACTTCCTG
TTTGCCGCGG GCAGCGCCGC CACCGGTACC TATCGCGGCC TGAAGCTCTG GGAGGCTGCG
GTGAAGGAAG CCGGCAAGAT CGACCGCGAC GGCGTCGCCA CGGCGATGGA TCACGCCAAG
ATCACCGACG GCCCGGGCGG GCCGGCTGAG ATGGTGCCGG GCAAACGGCA TTGCAAGATG
AACATGTACA CCGCGGTGGC CAAGAACGGC AGCTACGAGA TCATCGCCCG CAGTAACGGC
CTGGTCGATC CGAAGGAATG CTGA
 
Protein sequence
MSSDSPHNLT RRRFLSNFAF ASTGLATGVG SWVVRPDWAN AAAGAIKVGI ATDLTGPMGY 
AGNADANVAK MVLKQINDAG GLLGRPLELY IEDTASNEAV AVGNVRKLIQ RDKVDLVLGG
ITSSMRNAIK DVIVARGKTL YIYPQLYEGK ECTPNLFCTG PTPAQQCDEF IPWLIKNGGK
KFALPSANYV WPHTLNVYAR KVIEANGGEV VLEEYYPLDQ IDFSSTVNRI ISNKVDVVFN
TVIPPGVGPF FKQLYEAGFL KNGGRLACVY YDENTLGINQ PAEIEGLASC LDYFKAVAKT
DPVSAKIQAE YDKAYPGNFL FAAGSAATGT YRGLKLWEAA VKEAGKIDRD GVATAMDHAK
ITDGPGGPAE MVPGKRHCKM NMYTAVAKNG SYEIIARSNG LVDPKEC