Gene Rpal_4631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4631 
Symbol 
ID6412317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4991082 
End bp4992092 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content66% 
IMG OID642714510 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001993597 
Protein GI192292992 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.464324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGTT TCCGTGCCGC TGCCCTCGCA CTTGCCGCCG TTTTCGTGCC GTTCGCCGCC 
GGGGCCGCCG AGCAGGTCAA CGTCTACACC TACCGCGAGA CCAAGCTGGT GCAGCCGCTG
TTCGATGCCT TCACCAAGGA CACCGGGATC GCCGTCAACG TCATTTCGGC GAGTTCGGGC
CTGGAACAGC GGATCAAGGC CGAGGGCGCT GACAGCCCGG CCGACGTGCT GCTGACGGTC
GACATCGGCC GGATCGACGA CGCGGTCGCG GCCGGCATCA GCCAGCCGAT CAACTCGGCC
GTGATCGACG AGATCGTGCC GGCACAGTTC CGCGATCCCA ACGGCCAGTG GGCCGGCATC
TCGATGCGGG CGCGGGTGAT CTACGCCTCG AAAGATCGCG TCAAGCAGGA AGCGATCACC
TATGAGGAAC TGGCCGACCC GAAGTGGAAG GGCAAGATCT GCATCCGCTC CGGCCAGCAC
ATCTACAACA ACGCGCTGTT CGCCGCTTAC GTCGCCAAGC ACGGCGAGGC CAAGGCCGAG
GAATGGCTGC GCGGCCTGAA GGCCAATCTG GCGCAGAAGC CGTCGGGCGG CGACCGCGAG
ACCGCGCGCG ACGTCGCGGC CGGCAAATGC GATCTCGGCA TCGGCAACAC CTACTACTGG
GCGCTGATGC TGAACGATCC CGACAAGAAG GCCTGGGCGG ATGCAACCCG CGTGGTGCTG
CCGACCTTCG AAGGCGGCGG CACCCACGTC AACCTGTCGG GCGTGGTGCT CGCCAAGCAC
GCGCCCAACA AGGCCAACGC GGTGAAGCTG ATCGAATGGC TCGTCGGTGA GAAGGCGCAG
CAGATCTACG CCGACGCCAA CTACGAATAT CCGATCCGCG CCGGCGTGCC GCTCAATCCG
ATCATCGCCG GCTACGGCAA GCTGAAGCCG GATCCGCTGC CGATCGCCAA GATCGCCGCC
AACCGCAAGG CCGCCTCGAC GCTGGTCGAC AAGGTCGGAT TCGACAACTG A
 
Protein sequence
MSRFRAAALA LAAVFVPFAA GAAEQVNVYT YRETKLVQPL FDAFTKDTGI AVNVISASSG 
LEQRIKAEGA DSPADVLLTV DIGRIDDAVA AGISQPINSA VIDEIVPAQF RDPNGQWAGI
SMRARVIYAS KDRVKQEAIT YEELADPKWK GKICIRSGQH IYNNALFAAY VAKHGEAKAE
EWLRGLKANL AQKPSGGDRE TARDVAAGKC DLGIGNTYYW ALMLNDPDKK AWADATRVVL
PTFEGGGTHV NLSGVVLAKH APNKANAVKL IEWLVGEKAQ QIYADANYEY PIRAGVPLNP
IIAGYGKLKP DPLPIAKIAA NRKAASTLVD KVGFDN