Gene Rpal_5167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5167 
Symbol 
ID6412867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5570293 
End bp5571249 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content66% 
IMG OID642715057 
Productextracellular solute-binding protein family 3 
Protein accessionYP_001994130 
Protein GI192293525 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTATA AGACCGCCGG TGCGGAACCC CTAATTATGC AACCACAGAT TGCGATGTTG 
ATCAATGGCC GACCGGCCGG CTGGCGCGCA GCCGTGGCGG GCCTGGCGCT CGCCGTGACG
ACGCTGCTGA CGCCTGGCGC CGCGCGGGCT GCGGAGACCG CGCCCGAGGC GAAGGCGATC
GCGGATGCGA CGGCGCATGC AGTGCCGGGG TTTTGGGATC CGCGGCGGCG TCCGGAGCGG
CCGGACATGT CGCGCCTGAC GATGATCCGG TTCCTGACCG AGATCGATTA TCCGCCGTTC
AACTTCACCG GGGCCGACGG CAATCCGGCG GGGTTCAACG TCGATCTGGC GCGCGCGCTG
TGCGACGAGA TCAAGATCAC CTGCACGGTG CAGATGCGGA AGTTCGAGAC CCTGCTCGAC
GCGCTCGCCG GCAATCGCGG CGATGCCATC ATCGCGTCGC TGGCGGTGAC GCCGCAGACC
CGCACCAAGC TGGACTTCAC CGATCCCTAT TACCGCACGC CGGCGCGCTT CGTCGCCCGC
AAGGATGCGG TGATGCCGGA GATGCGCCCC GAGTTTCTCG AAGGCCGCAA GGTCGGCGCG
GTCGCAGGTT CGGCGCATGA GGCCTATCTC AAGGCGATGT TCACGGACGC CGAGCTGCAT
TCCTATCCGA ATGCCGAGGC GCTGCGTGCC GCGCTGAAGC GCGGCGAGGT GGACTTCATC
TTCGGCGACG CGATCTCGCT GGCGTTCTGG ATCAACGGCA CCGACTCGGA GAATTGCTGC
GCGTTCTCCG GCGCCCCGTT CCTGGAGAGC CGCTATTTCG GCGAGGGCGT CGGCATCGCG
GTGCGCAAGG GCAACGACAC GTTGCGCCAG GCGCTGAATT GGGCGCTGTT CCGGGTTTGG
GAAAAGGGCC AGTACACCGA CTTGTGGCTC CGGTATTTTT CCGTCAGCCC GTTTTGA
 
Protein sequence
MVYKTAGAEP LIMQPQIAML INGRPAGWRA AVAGLALAVT TLLTPGAARA AETAPEAKAI 
ADATAHAVPG FWDPRRRPER PDMSRLTMIR FLTEIDYPPF NFTGADGNPA GFNVDLARAL
CDEIKITCTV QMRKFETLLD ALAGNRGDAI IASLAVTPQT RTKLDFTDPY YRTPARFVAR
KDAVMPEMRP EFLEGRKVGA VAGSAHEAYL KAMFTDAELH SYPNAEALRA ALKRGEVDFI
FGDAISLAFW INGTDSENCC AFSGAPFLES RYFGEGVGIA VRKGNDTLRQ ALNWALFRVW
EKGQYTDLWL RYFSVSPF