Gene Rpal_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0833 
Symbol 
ID6408486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp879448 
End bp881046 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content62% 
IMG OID642710746 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001989866 
Protein GI192289261 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.54369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTTA AGAGTTCATG GGTGCTGGGC GCGTTGTTTG CTGCGGCGAG CGCAATCGGT 
GCGGCGCGTG CGGAAACAGT GGTGCGTTAC GGGATCTCGA TGGCCGATAT TCCGCTGACC
ACGGGGCAAC CGGATCGCGG CGCCGGCGCG TATCAGTTCA CCGGCTACAC GATCTATGAT
CCGCTGGTGG CGTGGGAGAT GAACGTCGGC GATCGGCCCG GCAAGCTGGT GCCGGGGCTC
GCCACCGAAT GGAAGGTCGA TGATGCCGAC AAGACCAAGT GGCGCTTCAC CTTGCGCAAG
GGCGTCAAGT TTCACGACGG CAGCGACTTC AACGCCGATG CGGTGATCTG GAATCTCGAC
AAGGTTCTCA ACGACAAGGC GCCGCAATTC GACAAGCGCC AGAGCGCGCA GGTGAAGACC
CGGCTGCCGT CGGTGAAGAG CTACGCCAAG ATCGACGATT CCACCGTCGA GATCACCACC
AAGACGGTCG ACTCGTTCTT TCCCTATCAG ATGCTGTGGT TCCTGGTGTC GAGCCCGGCG
CAGTATGAGA AGGTCGGAAA GGACTGGGAT AAGTTCGCCG CCAATCCGTC GGGCACCGGT
CCCTTCAAGC TCACCAAGCT GGTGCCGCGC GAGCTCGCCG AGCTCACCAA GAACGATGAG
TATTGGGACA AGTCGCGGCT GCCGAAGACC GACAAGCTGG TGCTGATCCC GATGCCTGAA
GCGTTGACCC GCACCAATGC ACTGCTGGCC GGGCAGGTCG ATCTGATCGA GACGCCTGCG
CCCGACGCGG TGCCGCAGCT CAAGGCGGCC GGCATGAAGA TCGTCGACAA CGTCACACCG
CACGTCTGGA ACTATCACCT CAGCGTGCTG CCCGGCTCGC CCTGGACCGA CGTGCGCCTG
CGCAAGGCGC TCAATCTCGC GATCGATCGC GAGGCGGTGG TCGGACTGAT GAACGGCCTC
GCCAAGCCGG CGGTCGGACA GGTCGATCCG TCGAGCCCGT GGTTCGGCAA TCCGTCGTTC
AAGATCAAAT ACGATCTGGC GGAAGCCAAG AAGCTGGTGA AGGAAGCCGG CTATTCGCCG
GAGAAGCCGC TGAAGACCAC CTTCATCATT GCCAATGGCG GCACCGGCCA GATGCTGTCG
CTGCCGATGA ACGAGTTCCT GCAGCAGAGC TTCAAGGAGA TCGGCATCGA CGTCGAGTTC
AAGGTGGTCG AACTCGAAGT GCTGTACACC GCGTGGCGCA AGGGGGCGGC CGATGAATCC
AACGCCGGCA TCACCGCCAA CAACATCGCC TACGTCACCT CCGATCCGCT GTATGCGATC
GTGCGGTTCT TCCATTCCGG GCAGGTGGCG CCGGTCGGCG TCAACTGGGG CGGCTACAAG
AATCCGAAGG TGGATGCGCT GATCGACGAC GCCAAGACCA CGTTCGAGCC GAAGAAGCAG
GACGAACTGC TGGCGCAGGC GCACTCGCTG ATCGTCGACG ACGCAGCGCT GGTGTGGGTG
GTGCACGATA CCAATCCGCA CGCGCTGTCG CCGAAGGTGA AGAGCTTCGT GCAGGCCCAG
CACTGGTTTC AGGACCTGAC CACGATTGGG CTGCAGTAA
 
Protein sequence
MRLKSSWVLG ALFAAASAIG AARAETVVRY GISMADIPLT TGQPDRGAGA YQFTGYTIYD 
PLVAWEMNVG DRPGKLVPGL ATEWKVDDAD KTKWRFTLRK GVKFHDGSDF NADAVIWNLD
KVLNDKAPQF DKRQSAQVKT RLPSVKSYAK IDDSTVEITT KTVDSFFPYQ MLWFLVSSPA
QYEKVGKDWD KFAANPSGTG PFKLTKLVPR ELAELTKNDE YWDKSRLPKT DKLVLIPMPE
ALTRTNALLA GQVDLIETPA PDAVPQLKAA GMKIVDNVTP HVWNYHLSVL PGSPWTDVRL
RKALNLAIDR EAVVGLMNGL AKPAVGQVDP SSPWFGNPSF KIKYDLAEAK KLVKEAGYSP
EKPLKTTFII ANGGTGQMLS LPMNEFLQQS FKEIGIDVEF KVVELEVLYT AWRKGAADES
NAGITANNIA YVTSDPLYAI VRFFHSGQVA PVGVNWGGYK NPKVDALIDD AKTTFEPKKQ
DELLAQAHSL IVDDAALVWV VHDTNPHALS PKVKSFVQAQ HWFQDLTTIG LQ