Gene Rpal_1660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1660 
Symbol 
ID6409317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1779123 
End bp1780742 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content62% 
IMG OID642711548 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001990663 
Protein GI192290058 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.40005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTC GTGATTTCCT CAAGTCCGCT ACAGCCCTCG CCGCCGGCGC GATGGTGCCA 
GCACCGGCGA TTTGGTCGGC CGCCAAGGCC GACGCCCGGT CCGAAACCCT GCTGGTTGTC
TCCGAGAGCG GCCCGAACAA CCTCGACATC CACGGCGTCG GCACCAACGT GCCCGGCTAC
GAGGTGTCGT GGAATTGCTA CGACCGGCTG ATCACCCACG AAATGAAGGA AGGCCCCGGC
GGCGTTCCCT ACTACGATAA GGACAAGTTC AAGGGCGAGC TCGCCGACGA CATGGTCATC
GGCGACATGT CTGCGACCTT CAAGCTGAAG AAGAACGCCA CCTTCCAGGA CGGCACCCCG
GTCACCGCCA AGGACGTGAA GTGGTCGCTC GACCGCTCGG TCAGCGTCGG CGGCTTTCCG
ACCTTCCAGA TGAGCGCCGG CTCGCTGACC AAGCCCGAAC AGTTCGTGGT GGTCGACGAT
CACACCGTGC GGGTCGACTT CCTGAAGAAG GACAAGCTCA CGATCCCGGA TCTCGCGGTG
ATTGTGCCCT GCGTCGTCAA TTCCGAACTG GTGAAGAAGA ACGCGACCGA AAAAGACCCG
TGGGGTCTCG AATACACCAA GCAGAACACA GCCGGTTCCG GCGCCTATCG GGTGGTGAAG
TGGACCGCCG GCACCGAAGT GATCATGGAG CGCAACGACA AGTGGGTCGG CGGCCCGCTG
CCGAAAATCA AGCGCGTGAT CTGGCGCATG GTGCCGCAGG CCGGCAACCG CCGGGCACTG
CTGGAGCGCG GTGACGCCGA CATCTCCTAT GAGCTGCCGA ACCAGGACTT CGCCGAGATG
AAGCGCGACG GCAAAGTCAA CGTGGTGTCG TTGCCGATCT CCAACGGCAT CCAGTATCTC
GGCATGAACG TCACCCAGCC GCCGTTCAAC AACCCGAAGG TGCGTGAGGC GGTCGCCTAC
GCGGTGCCAT ATCAGAAGAT CATCGACGCG GTGATGTTCG GCCTCGCCAA CCCGATGTTC
GGCGCGGCGG CCGACAAGGC GACCGAAGTG AAGTGGCCGC AGCCGACCAA GTACAATACC
GACATGGCGA AGGCCAAGGC GCTGATGGCA GAAGCCGGCT ACGCGAACGG CTTCGACACC
ACGCTGTCGT TCGACCTCGG CTTCGCCGGC GTCAACGAGC CGATGTGCAT CCTGATCCAG
GAAAGCCTGG CGCAGATCGG CATCCGCTGC ACCATCAACA AGATCCCCGG CGCCAACTGG
CGCACCGAGC TGAACAAGAA GGTGATGCCG CTCTACGTCA ACATCTTCTC GGGCTGGCTC
GATTATCCGG AGTACTTCTT CTACTGGTGC TACCACTCCG GCAAGTCGAT CTTCAACACC
ATGGGCTACG ACTCGCCCGA GATGGACAGG CTGATCGACA GCTCCCGCAT CGCCGCAGCA
ACCGGCGAAA CCGCGACCTA CGACAGCGAC GTCAAGGGCT TCGTCGACCT CGCCTTCAAG
GACATCCCGC GCGTCCCACT GTACCAGCCC TACCTCAACG TCGCGATGCA GAAGAACGTC
TCCGGCTTCG CCTACTGGTT CCACCGCCGG CTCGACTACC GGACGATGGT GAAGGGCTGA
 
Protein sequence
MKRRDFLKSA TALAAGAMVP APAIWSAAKA DARSETLLVV SESGPNNLDI HGVGTNVPGY 
EVSWNCYDRL ITHEMKEGPG GVPYYDKDKF KGELADDMVI GDMSATFKLK KNATFQDGTP
VTAKDVKWSL DRSVSVGGFP TFQMSAGSLT KPEQFVVVDD HTVRVDFLKK DKLTIPDLAV
IVPCVVNSEL VKKNATEKDP WGLEYTKQNT AGSGAYRVVK WTAGTEVIME RNDKWVGGPL
PKIKRVIWRM VPQAGNRRAL LERGDADISY ELPNQDFAEM KRDGKVNVVS LPISNGIQYL
GMNVTQPPFN NPKVREAVAY AVPYQKIIDA VMFGLANPMF GAAADKATEV KWPQPTKYNT
DMAKAKALMA EAGYANGFDT TLSFDLGFAG VNEPMCILIQ ESLAQIGIRC TINKIPGANW
RTELNKKVMP LYVNIFSGWL DYPEYFFYWC YHSGKSIFNT MGYDSPEMDR LIDSSRIAAA
TGETATYDSD VKGFVDLAFK DIPRVPLYQP YLNVAMQKNV SGFAYWFHRR LDYRTMVKG