Gene Rpal_2879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2879 
Symbol 
ID6410548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3141012 
End bp3141965 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content64% 
IMG OID642712759 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_001991862 
Protein GI192291257 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCGT CGCGTAGATC GTTGTTCGCC CTTGCCTTCG CCGCGCTCGC GGCATCGTCC 
ATCGGCTTCG CCTCTGCGGC GGACCTGAAG GAAGTCCGCA TCGGCTTCCA GAAAGCTGGC
ATTCAGCCTG CCGTGAAGGA ACGCGGCGTC CTCGAAGCGG CGCTGAAGGA AAAAGGCCTT
TCCGTGAAGT GGGTCGAGTT CGCCTTCGGG CCGCCGCTGC TCGAAGCGCT CAACACCGGC
AATATCGATT TCGGTTACAC CGGCGACACC CCGCCGATCT TTGCGCAGGC CGCAGCGGCC
AACCTGCTGT ACGTTGCCGC CCTGCCCGGC TCCGGCAAGA ACGAAGGAAT CGTCGTTCCC
GCGAACTCGC CGATCAAGTC GGTCGCCGAT CTCAAAGGCA AGCGGCTCGC TATTCCGAAA
GGATCGAGCG CGCATAACAC CGCAGTCGCC ATTCTCGAAA AGGCAGGCCT GCAGTTCACC
GACGTGACCG CGGTGTACCT GCCGCCTGCC GATGGCACCG CGGCTTTCGC CGGCGGAACG
GTAGACGCGT GGGCGATCTG GGATCCGTAC CTCGCACTGG CCGAGAAGAG CGGCGCCCGC
GTGCTGAGCT TCGCCGGCGA CGCCCACGAC TCGATCGGCT TCTTCCTTGC CAACCGCGAA
TTCACCAATG CCCATGGCAA CCTCGTCGCC TTGTTGAACC AGACTTTCGC CAAGGAAGCG
CAGTGGGCAA ACGGCCATCG CGACGAGATC ACCAAATCCC TCGCCGCCTC GACCGGCGTC
GATCCCGCCG TGGTCAGCAC TCTGGTCGGG CGATCGGTGT TTGAGGTCAC GCCGGTCACC
GACAAGATTC TGGCGGAGCA GCAACAGACA GCGGACCGGT TCCACAAGCT CGGCCTGATC
CCGAAACCGA TCAACGTCCG CGACATCGTC TGGAAGTGGT CGCCGGCGTC CTGA
 
Protein sequence
MMSSRRSLFA LAFAALAASS IGFASAADLK EVRIGFQKAG IQPAVKERGV LEAALKEKGL 
SVKWVEFAFG PPLLEALNTG NIDFGYTGDT PPIFAQAAAA NLLYVAALPG SGKNEGIVVP
ANSPIKSVAD LKGKRLAIPK GSSAHNTAVA ILEKAGLQFT DVTAVYLPPA DGTAAFAGGT
VDAWAIWDPY LALAEKSGAR VLSFAGDAHD SIGFFLANRE FTNAHGNLVA LLNQTFAKEA
QWANGHRDEI TKSLAASTGV DPAVVSTLVG RSVFEVTPVT DKILAEQQQT ADRFHKLGLI
PKPINVRDIV WKWSPAS