Gene Rpal_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0129 
Symbol 
ID6407772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp142414 
End bp143694 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID642710038 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001989167 
Protein GI192288562 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.57996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAAGA GATGGATCGC CGCAGCCGCG ATGACGCTGG TTGCGGGCAT CGCCCACGCC 
CAGCAGCAGA CCGAAGTGGT GCTGCAGTTC CCGTATCCGG AACTGTTCAG CGAGACCCAC
AAGCGGATCG CCGAAGAGTT CGCCAAGGTG CATCCGGAGA TCAAGGTCAC GTTCCGGGCG
CCGTACGAGT CGTACGAAGA GGCCACCCAG AAGGTGCTGC GCGAGGCGGT GACCAATCAG
CTGCCGGACG TCACCTTCCA GGGCCTGAAC CGCATCCGCG TGCTGGTCGA CAAGACCATC
CCGGCGCCGC TCGATGGCTA CATCGCGGCC GAGAAGGATT TCGACAAGCA GGGCTTCCAT
CAGGCGATGT ACGACATCGG CACCGCCAGC GGCAAAGTCT ATGCGCTGCC GTTCGCGATC
TCGCTGCCGA TCGTCTACGT CAATCTCGAT CTGGTGAAAC AGGCCGGCGG CGATGTGAAC
AACCTGCCGA CCACCTGGGA CGGCCTGTTG GATCTGGCCA AGAAGGTCAA GGCGCTCGGC
CCCGACACCA ACGGCATCAC GTATGCCTGG GACATCACTG GCAACTGGCT GTGGCAGGCG
CCGGTGTTCG CACGCGGCGG CACCATGCTC AACGCCGATG AAACCAAGGT GGCGTTCGAC
GGCCCCGAAG GCCAGTTCGC GATGCGCACC ATCGCCCGCC TGGTCACCGA AGGCGGCATG
CCGAATCTCG ACCAGCCGTC GATGCGCGCC ACCTTCGCGG CCGGCAAGAC CGGAATCCAC
ATCACCTCGA CCTCGGACCT GAAGAAGACC ACCGACATGA TCGGCGGCAA GTTCGCGCTG
AAGACCATCG CGTTCCCGGA CGTCGTCAAG CCGAACGGCC GGTTGCCGGC GGGCGGCAAT
GTCGTGCTGA TCACCGCCAA GGACAAAGCC AAGCGCGATG CCGCCTGGGA AGTGGTGAAG
TTCTGGACCG GTCCGAAGGG CGCGGCGATC ATGGCTGAGA CCACCGGCTA CATGCCGCCC
AACAAGGTCG CCAACGACGT CTATCTGAAG GATTTCTACG CCAAGAACCC GAACAACTAC
ACCGCGGTCA GCCAGCTCGC GCTGCTGACC AAGTGGTATG CGTTCCCGGG CGACAACGGC
CTGAAGATCA CCGACGTGAT CAAGGATCAT CTCAACTCGA TCGTCTCCGG CGCCCGCGCC
AAGGAGCCGG ATGCGGTGCT GGCCGACATG ACCCGCGACG TCCAGAACCT GCTGCCGAAG
ACCGTCGGCG CCGCCAAGTA A
 
Protein sequence
MLKRWIAAAA MTLVAGIAHA QQQTEVVLQF PYPELFSETH KRIAEEFAKV HPEIKVTFRA 
PYESYEEATQ KVLREAVTNQ LPDVTFQGLN RIRVLVDKTI PAPLDGYIAA EKDFDKQGFH
QAMYDIGTAS GKVYALPFAI SLPIVYVNLD LVKQAGGDVN NLPTTWDGLL DLAKKVKALG
PDTNGITYAW DITGNWLWQA PVFARGGTML NADETKVAFD GPEGQFAMRT IARLVTEGGM
PNLDQPSMRA TFAAGKTGIH ITSTSDLKKT TDMIGGKFAL KTIAFPDVVK PNGRLPAGGN
VVLITAKDKA KRDAAWEVVK FWTGPKGAAI MAETTGYMPP NKVANDVYLK DFYAKNPNNY
TAVSQLALLT KWYAFPGDNG LKITDVIKDH LNSIVSGARA KEPDAVLADM TRDVQNLLPK
TVGAAK