Gene RPD_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1586 
Symbol 
ID4022066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1780187 
End bp1781365 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID637961781 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_568724 
Protein GI91976065 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.764817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATTG GCAGACGAAC ACTGCTGCAG GCCGCCTGTA TCGCCGTCGC AACCGGCGCC 
CCGATCGCGG CGGCGCAAGC CGAAGACACC TTCAAGGTCG GCCTGATCGT GCCGATGACC
GGCGGCCAAG CCTCGACCGG CAAGCAGATC GACAACGCGA TCAAGCTGTA CATCAAGAAG
CATGGCGACA CCGTCGCCGG CAAGAAGATC GAGGTGATCC TCAAGGACGA CGCCGCGATT
CCGGACAACA CCAAGCGCCT CGCCCAGGAA CTGATCGTCA ACGACAAGGT CAATGTGATC
GCCGGCTTCG GCATCACGCC GGCCGCGCTC GCCGCGGCGC CGCTCGCCAC CCAGGCGAAG
GTGCCGGAAA TCGTGATGGC CGCCGGCACC TCGATCATCA CCGAACGCTC GCCCTATATC
GTCCGCACTT CGTTCACGCT GGCGCAGTCC TCGATCATCA TCGGCGACTG GGCGGCGAAG
AACGGCATCA AGAAAGTCGC GACGCTGACC TCGGACTATG CCCCGGGCAA CGACGCGCTG
GCCTTCTTCA AGGAGCGCTT CACCGCCGGC GGCGGCGAGA TCGTCGAAGA GATCAAGGTG
CCGCTCGCCA ATCCGGACTT CGCACCGTTC CTGCAGCGCA TGAAGGACGC CAAGCCGGAT
GCGATGTTCG TGTTCGTGCC GGCCGGCCAG GGCGGCAACT TCATGAAGCA GTTTGCCGAG
CGCGGCCTCG ACAAGAGCGG CATCAAGGTG ATCGGTCCCG GTGACGTGAT GGACGACGAT
CTCCTGAACA GCATGGGTGA CGCCGCGCTC GGCGTGGTCA CCGCCCATAT GTACTCGGCG
GCGCATCCCT CGGCGATGAA CAAAGAGTTC GTCGCCGCCT ACAAGAAGGA GTTCGGGCAG
CGGCCGGGCT TCATGGCGGT CGGCGGCTAT GACGGCATCC ATCTCGTCTT TGAGGCGCTG
AAGAAGACCG GCGGCAAGGC CGATGGCGAT TCTCTGATCG CGGCGATGAA AGGCATGAAG
TGGGAAAGCC CGCGCGGTCC GATCTCGATC GATCCGGAAA CCCGCGACAT CGTCCAGAAC
ATCTACATCC GCAAGGTCGA AAAGGTCGAC GGCGAACTCT ACAATATCGA ATTCGCCAAG
TTCGACGCCG TCAAGGACCC CGGCAAGACG AAGAAGTAG
 
Protein sequence
MLIGRRTLLQ AACIAVATGA PIAAAQAEDT FKVGLIVPMT GGQASTGKQI DNAIKLYIKK 
HGDTVAGKKI EVILKDDAAI PDNTKRLAQE LIVNDKVNVI AGFGITPAAL AAAPLATQAK
VPEIVMAAGT SIITERSPYI VRTSFTLAQS SIIIGDWAAK NGIKKVATLT SDYAPGNDAL
AFFKERFTAG GGEIVEEIKV PLANPDFAPF LQRMKDAKPD AMFVFVPAGQ GGNFMKQFAE
RGLDKSGIKV IGPGDVMDDD LLNSMGDAAL GVVTAHMYSA AHPSAMNKEF VAAYKKEFGQ
RPGFMAVGGY DGIHLVFEAL KKTGGKADGD SLIAAMKGMK WESPRGPISI DPETRDIVQN
IYIRKVEKVD GELYNIEFAK FDAVKDPGKT KK