Gene Rpal_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0106 
Symbol 
ID6407749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp115283 
End bp116884 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content63% 
IMG OID642710015 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001989144 
Protein GI192288539 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0909337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTGA CGAAGCGATC CTTTGTAGTG GGAGCCCTCG GCGGCATTGC GCTGATGGGT 
CTGCCGCTCG AAACCAAAGC GGAAGGTGCG GGGAGCGGCG GAACGCTGGT GATCGGTTCG
ACGCAGGTGC CGCGGCATTT CAACGGCGCC GTCCAGTCCG GTATCGCCAC CGCGCTCCCG
AGCACGCAGA TCTTCGCCAG TCCGCTGCGC TACGACGACA ACTGGAACCC GCAGCCCTAT
CTCGCCGAGT CCTGGGACGT CTCGAAGGAC GGGCTGACGG TGACGCTGAA GCTGGTCAAG
AACGCCACCT TCCACGACGG CAAGCCGGTG ACTTCCGAGG ACGTCGCGTT CTCGATCATG
ACCATCAAGG CCAATCATCC GTTCAAGACG ATGCTGGCCG CGGTCGAGAG CGTCGATACG
CCGGATCCGC ATACGGCGGT GATCAAGCTG GCGCATCCGC ACCCGGCGCT GCTGCTGGCG
ATGTCGCCTG CATTGATGCC GATCCTGCCG AAACACGTCT ATGGCGACGG CCAGGACGTC
AAATCGCATC CGGCCAACCT GAAGCCGATC GGCTCCGGTC CCTACAAGCT CACCGAGTAC
AAGCAGGGCG ACTACTACAC GCTGGAGAAG TTCGACAAAT TCTTCATCCC GGGGCGGCCG
AAGCTCGACA AGATCGTGGT GCGGCTGATT TCGGATCCGA ATGCCCTGAT GGTGTCGGCC
GAGCGCGGCG ATGTTCATAT GGTGCCCTAC TTCACCGGCG TGCGCGACAT TGAGCGGCTG
GAAAAGGCGC CGAACGTCGT CGTCACCGAC AAGGGCTTTG CCGGCATCGG TGCGCTGAAC
TGGCTCGCCT TCAACACCAA GAAGAAGCCG CTCGACGACG TTCGCGTCCG TCAGGCGATC
GGCTATGCGG CGAACCGCGA GTTCATCGTC AAGAAGCTGA TGGGCGGCAA AGCCTTGCCC
TCGACCGGAC CGATCGCGCC GGGCTCGCCG CTGGAAGAGA AGAACGTCGA GCAGTACAAA
TTCGACATCG CCAAGGCCAA CAAGCTGCTC GACGAGGCTG GGCTCAAGCC GGACGGCTCC
GGCGTGCGCA CCACGCTGAC GATCGACTAC ATCCCCGGCA ACGACGAGCA GCAGCGCAAC
GTCGCCGAAT ATCTGCGTTC GGCGCTGAAG CGGGTCGGGA TCAATCTCGA GGTTCGCGCC
GCTCCCGACT TCCCGACCTG GGCCCAGCGT GTCGCCAGTT TCGACTTCGA CATGACGATG
GACACCGTGT TCAACTGGGG CGACCCGGTG ATCGGCGTCG ACCGGACTTA TCTGAGCTCG
AACATCCGCA AGGGCATCAT CTGGTCGAAC ACCCAGCAGT ACGCCAATCC GAAGGTCGAC
GAGATCCTCG GCCAGGCCGC CCAGGAGAGC TCGCCGGACA AGCGCAAGGC GCTGTATTCG
GAGTTCCAGA AGATCGTCGT CGAAGATGCG CCGATCTTCT ACATCAACGC CACTCCGTAC
CACACCTCGT TCGCCAAGGG GCTCGGCAAC CTGCCGACCA CGGTGTGGGG CGTCGCCTCG
CCGCTCGACG AGCTGTACTG GGTGACTCCG CCGAAGAACT GA
 
Protein sequence
MMLTKRSFVV GALGGIALMG LPLETKAEGA GSGGTLVIGS TQVPRHFNGA VQSGIATALP 
STQIFASPLR YDDNWNPQPY LAESWDVSKD GLTVTLKLVK NATFHDGKPV TSEDVAFSIM
TIKANHPFKT MLAAVESVDT PDPHTAVIKL AHPHPALLLA MSPALMPILP KHVYGDGQDV
KSHPANLKPI GSGPYKLTEY KQGDYYTLEK FDKFFIPGRP KLDKIVVRLI SDPNALMVSA
ERGDVHMVPY FTGVRDIERL EKAPNVVVTD KGFAGIGALN WLAFNTKKKP LDDVRVRQAI
GYAANREFIV KKLMGGKALP STGPIAPGSP LEEKNVEQYK FDIAKANKLL DEAGLKPDGS
GVRTTLTIDY IPGNDEQQRN VAEYLRSALK RVGINLEVRA APDFPTWAQR VASFDFDMTM
DTVFNWGDPV IGVDRTYLSS NIRKGIIWSN TQQYANPKVD EILGQAAQES SPDKRKALYS
EFQKIVVEDA PIFYINATPY HTSFAKGLGN LPTTVWGVAS PLDELYWVTP PKN