Gene Rpal_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1571 
Symbol 
ID6409228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1679823 
End bp1680887 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID642711463 
Productputative sulfate ester transporter, periplasmic binding component 
Protein accessionYP_001990578 
Protein GI192289973 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTGCG CGTTGCGGCT GTCCGCGGTT GTGCTGTTTG TTGGTGCTCT GTGGTCTTCG 
GCGTTTGCGG CGGAGCCAGT GACGATCCGC ATCGGCACGC CGGATCAGAG CGCCGGGCCG
ACGCCGTCCG GCGGCATGGG GATCGTCACC TACATCGCCG GCAAGCAGTT ACTCGAGCAG
GAGTTCGCCA AAGACGGCAT CAAAGTCGAA TGGACGTTCT TCAAGGGTGC CGGTCCCGCG
GTCAACGAAG CACTGGCGAA TAAGCAGCTC GATGTCGTCT ATCTTGGCGA CCTCGCCGCG
ATTATCGGCC GCGCCAACGG TCTGGCGACG CGCTTCCTGG TGCCGGTGCG CGGCAACAAT
GCTTACCTCG CGGTACCACT CGACTCCGAC GTCAAGAAGG TCGAGGACCT CAAGGGTAAG
CGTGTCACCG TGTTTAAGGG GACTGCCTAT CAGCTCGTGC TCGATCGGGC GCTGGCCAAA
GCGGGGCTGA GCGAGCGTGA TCTGCAGGTC GTCAACCTGG ATTGGAGCGC GGCGTCGGCT
GCGCTAGCCG CCAAGCAGCT CGACGGCAAC TGGGCCGGCT TGCAGGCGGT GACGCTGCAG
GAAAAGGGGC TGGCGCGGAT CGCGCTGAGC GCTCGCGATC TAGGCCGCGA GTTCACGGTT
CAGAGCGGAT TCCTCGCCCG GGAGGAATTC ATCGCGGCAC ATCCCGATCT CGTCCAACGG
CTCGTCACTG TGGTGGTCAA GGCACAGCGC GATCTGTCGC AGTCGGACCA CCTCGAGGAT
TTCATCATCT TTGCGTCGCA GCGCTCCGGC ATTCCGGCCT CGCTCGGCCG CACCGAATAC
GGCGGAGAGG ATCTGAAGTT TCGGTTCTCG CCGTTGATCG ACGAGTTCGT CATCGACGGG
CTTCGCGTTG GCGTCGAGCA GGCGAAGGAA CTGAACCTGG TCCGAAAGAC TTTCGACGTC
GGCCCGTGGT TTGAGCCGAG GTTCGTCGAC AAGGCGGTCG AAGACCTCGG GCTGAAGAGC
TACTGGCCGC GTTACGACAA ATCCGGGCAG CCGCTGGGGC AATGA
 
Protein sequence
MFCALRLSAV VLFVGALWSS AFAAEPVTIR IGTPDQSAGP TPSGGMGIVT YIAGKQLLEQ 
EFAKDGIKVE WTFFKGAGPA VNEALANKQL DVVYLGDLAA IIGRANGLAT RFLVPVRGNN
AYLAVPLDSD VKKVEDLKGK RVTVFKGTAY QLVLDRALAK AGLSERDLQV VNLDWSAASA
ALAAKQLDGN WAGLQAVTLQ EKGLARIALS ARDLGREFTV QSGFLAREEF IAAHPDLVQR
LVTVVVKAQR DLSQSDHLED FIIFASQRSG IPASLGRTEY GGEDLKFRFS PLIDEFVIDG
LRVGVEQAKE LNLVRKTFDV GPWFEPRFVD KAVEDLGLKS YWPRYDKSGQ PLGQ