Gene Rpal_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2571 
Symbol 
ID6410233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2778422 
End bp2780011 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content62% 
IMG OID642712449 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001991559 
Protein GI192290954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.13754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAC GTGAATTCGC CAAACTGGGC CTGATGGCCG GCGCGCTCGG CATCGGCGGC 
ATCCCGCTCG GCATCACGCG GGCCGCGGGA CAAACCCGCG GCGGCACGCT CAACACCATC
ATTCAGCCGG AGCCGCCGAT CCTGGTCACC GCGCTCAACC AGCAGCAGCC GACGCTGACG
CTGGGCGGCA AGATCTACGA GAGCCTGCTG CGCTATGATT TCGATCTCAA GCCGCTGCCG
GGCCTCGCCC AGTCCTGGGA AGTGTCGCCG GACAAGCTGA CCTACACGTT CAAGCTGTTC
CCCAACATCA CCTTCCACGA CGGCACGCCG CTGACCTCGG AAGACGTGGT GTTCTCGATC
ACGAAGATTC TGATGGAGAA CCACGCGCGC GCCCGCAACA CGTTCTCGCG CATCGACAAG
GCCGAGGCGC CGGATCCGCT CACGGTGGTC TTCACCTTGA AGAAGCCGTT CGCGCCGTTC
CTCACCGCGT TCGATTGCAC CACGGCGCCG ATCGTGCCGA AGCACATCTA CGACGGCACC
GACTATCGCA AGAACCCGGC CAACGCCAAG GCGATCGGCT CCGGCCCGTT CAAGTTGAAG
GAATGGGTGC GCGGTTCGCA CGTCCATCTG GTCAAGCACG AGGGTTACTA TCGTCCGGGT
GAGCCTGTCC TCGACGAGAT CATTTATCGC GTCATCCCGG ATTCCGCGTC GCGTTCGGTG
GCACTGGAGC AGGGGACCGT TCAGCTCACG CAGTGGACCG ACGTTGAGCT GTTCGAGGTG
CCGCGGCTGT CGAAGCTGCC GCATCTGACG ATGACCACCA AGGGCTACGA GTTCTTTGCG
CCGCATACCT GGCTGGAGAT CAACAACCGC ATCGCGCCGA TGAACGACAA GCGGTTCCGG
CAGGCGGTGA TGTATGCGAT CGACCGCAAG GCGCTGCTGA ACCGGATCTA TTTCGGCCTC
GGCAAGGTTG CGACCGGCCC CGTGTCGTCG AAGACCAAGT TCTACGAAAA GGACGTCAAG
AAGTACGACT TCTCGCCCGA GAAGGCGAAG GCGTTGCTCG ACGAGATGGG GCTGAAGCCG
GGCCCCGACG GCAAGCGCGT GACGATTCCC TTCCTGGTGC CGCCCTACGG CGAAACCCAT
CAGCGGACCG CCGAATTCCT GCGACAGTCG CTCGCCCGCG TCGGCATCGA TCTGCAACTG
CAGGGCATCG ATGTCGCGGG ATGGGCCGAG AAATTCAGCA ACTGGGATTT CTCGATGACT
ACGACCACGG TCTATCAGTT CGGCGATCCG GCGCTCGGCG TGTCGCGGAG CTACGTCTCC
TCCAACATCC GCAAGGGCAT CCTGTTCTCC AACACCTGCG GCTATTCCAA TCCGGAAGTC
GATCGGCTGT TCGAGGAGGC CGCGACCGCG ACGTCGGACG ACAAGCGTCA GGAGCACTAC
AGCGCGCTGC AGAAGATCAT GGTCGATGAG GTGCCGGTCA TCTGGCTGCT CGAGATCGAC
TATCCGAACC TCATGGACAA GCGGCTGAAG AACGTGGTGA CGTCGGCGAT CGGCGTGCAC
GACACCTTCG GGACGGTTTC GTTCGGATGA
 
Protein sequence
MNRREFAKLG LMAGALGIGG IPLGITRAAG QTRGGTLNTI IQPEPPILVT ALNQQQPTLT 
LGGKIYESLL RYDFDLKPLP GLAQSWEVSP DKLTYTFKLF PNITFHDGTP LTSEDVVFSI
TKILMENHAR ARNTFSRIDK AEAPDPLTVV FTLKKPFAPF LTAFDCTTAP IVPKHIYDGT
DYRKNPANAK AIGSGPFKLK EWVRGSHVHL VKHEGYYRPG EPVLDEIIYR VIPDSASRSV
ALEQGTVQLT QWTDVELFEV PRLSKLPHLT MTTKGYEFFA PHTWLEINNR IAPMNDKRFR
QAVMYAIDRK ALLNRIYFGL GKVATGPVSS KTKFYEKDVK KYDFSPEKAK ALLDEMGLKP
GPDGKRVTIP FLVPPYGETH QRTAEFLRQS LARVGIDLQL QGIDVAGWAE KFSNWDFSMT
TTTVYQFGDP ALGVSRSYVS SNIRKGILFS NTCGYSNPEV DRLFEEAATA TSDDKRQEHY
SALQKIMVDE VPVIWLLEID YPNLMDKRLK NVVTSAIGVH DTFGTVSFG