Gene Rpal_3960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3960 
Symbol 
ID6411641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4249865 
End bp4251190 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID642713841 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001992931 
Protein GI192292326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACT TTACCCTCGA TCGCCGCACG TTGTTGAAGG GTGGCGCGAT CACGTTGGCC 
ACGGCAGCGA CGATGTCCGC CGAGCAGTTG CTTGGTTACG CCAAGGCCTG GGCGCAGGCC
TCGCCGTGGA AGCCCGAACC GGGCGCCAAG ATCAATCTGT TGCGCTGGAA GCGGTTCGTC
GAAGCCGAAG ACGTCGCCTT CATGAAGATC GTCGATGCCT TCCAGAAGGC CAACAACGTC
ACCATCAACG TCTCCAACGA GTCCTACGAC GACATCCAGC CGAAGGCGTC GGTGGCTGCC
AACACCGGGC AGGGGCTCGA CATGGTGTGG GGCCTGTACT CGCTGCCGTT CCTGTTCCCG
AACAAATGTA CCGACGTCAG CGACGTCGCC GATTATCTCG CCAAGAAGTG CGGCGGCTGG
AGCGACTCCG GCAAGGCCTA CGGCATGTAT AACGGCAAGT GGATCGGCAT TCCGGTGGCG
GCTACCGGCG GCCTCGTCAA CTACCGGATC AGCGCGGCCG AGAAGGCCGG CCACAAGGAG
TTTCCGAAGG ATCTCGGCGG CTTCTCCGAT CTGGTGAAGG GCCTGAACAA GAACGGCACG
CCGGCCGGCA TGGCGCTGGG CCACGCCTCG GGCGACGCCA ACGGCTGGCT GCACTGGGCG
CTGTGGGCGC ACGGCGGCGC GCTGATCGAC AAGGACAGCA AGGTCGTCGT CAACTCACCA
GAGACCGCCA AGGCGCTCGA ATACGTCAAG GGGCTGTACG AGAACTTCAT TCCCGGCACC
GCGTCGTGGA ACGATGCTTC CAACAACAAG GCGTTTCTCG CCGGCCAGCT TTATCTGACC
ACCAACGGCA TCTCGATCTA CGTCACGGCG AAGAAAGACA ACAAGGAGAT GGCGGCGGAT
ATCAACCACG CGCATCTGCC CGCCGGCCTC AACGGCAAGA CCCGCGAGCT GCATCTCGGC
TTCCCGATCC TGATCTACAA CTTCACCAAG TTCCCCCAGA CCTGCAAGGC ATTCACCGCT
TTCATGATGG AGCCGGAGCA GTTCAACCCG TGGGTCGAGG CCGCGCAGGG CTATCTGTCA
CCGTTCCTAC TCGACTACGA GAAGAACCCG ATGTGGACCG CGGACCCGAA AAACACGCCG
TATCGCGACG TCGCCCGCAC CGCCTCGACG CCGGCCGGCG ATGCCCAGAT GGGCGAGAAC
GCCGCCGCGG CGATCGCCGA CTTCGTCGTG GTCGACATGT TCGCCAACTA CTGCACCGGC
CGCGAAGACG TGAAGACCGC CATGAGCAGC GCCGAACGCG CGGCGAAGCG GATCTTCCGG
GCGTAA
 
Protein sequence
MTDFTLDRRT LLKGGAITLA TAATMSAEQL LGYAKAWAQA SPWKPEPGAK INLLRWKRFV 
EAEDVAFMKI VDAFQKANNV TINVSNESYD DIQPKASVAA NTGQGLDMVW GLYSLPFLFP
NKCTDVSDVA DYLAKKCGGW SDSGKAYGMY NGKWIGIPVA ATGGLVNYRI SAAEKAGHKE
FPKDLGGFSD LVKGLNKNGT PAGMALGHAS GDANGWLHWA LWAHGGALID KDSKVVVNSP
ETAKALEYVK GLYENFIPGT ASWNDASNNK AFLAGQLYLT TNGISIYVTA KKDNKEMAAD
INHAHLPAGL NGKTRELHLG FPILIYNFTK FPQTCKAFTA FMMEPEQFNP WVEAAQGYLS
PFLLDYEKNP MWTADPKNTP YRDVARTAST PAGDAQMGEN AAAAIADFVV VDMFANYCTG
REDVKTAMSS AERAAKRIFR A