Gene Rpal_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2884 
Symbol 
ID6410553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3145580 
End bp3146527 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content62% 
IMG OID642712764 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_001991867 
Protein GI192291262 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.877922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCC GACAATTCCT CCAATTATCT GCCGGAACCG CTCTACTGCC GATTCTGTCA 
CGCTCAGCCG CAGCTGAGCC GCTGACCGAG ATCCGCATCG GCTATCAGAA GAACGGCGTG
TTGGTAATCG CGCGGCAACG GCGGACGCTG GAGGATCACT TCGCCGCACA GAATATCGGC
ATCAAATGGC TGGAATTTTC GTCCGGTCCC CCGATGCTCG AGGCGATGAA CGTCGGCAGC
ATTCACTATG GCGCCGTCGG TGACGCGCCG CCGATCTTTG CCCAGGCTGC TGGCGCTGCA
ATCGTCTACG CAGCAGGCCA GCCGATCACC AACGGTCAGG GTATCCTGGT TCCGAAAGAT
TCGCCGATCC GCGGTCTCGC CGATCTCAAG GGCAAGCGCA TCGGCTTCAC AAAGGGGTCG
AGCGCCCACA ATGTGGTCTT GCTGGCGCTC AAGAAGGCCG GCCTGACCTA TGGCGACATC
ACGCCGGTCT ACCTGTCGCC GCCGGACGCC GGCCCCGCAT TCGCGCAAGG CGGCATCGAT
GCCTGGTCGA TCTGGGACCC ATATTTCGCG ATCGGCGAAT TGAAGCAGAA TGGGCGCGTG
CTGATCAATG CATCCGAGGT CGGCCGGACC AACTCGTTCT ACATCGCCAA CCGTGAATTC
GCTCAACGAA ATGCGTTGAT CCTCAAGCAG ATCATCGACG TCACCAGCGC GACCGCGCGA
TGGGCAGAAG ATCATCGCGG CGACGTCGCT CAGTCCCTCA GTGCCGTCAC AGGCATCCCG
CTCGACATCC AGACGATCGC TGCCAACCGG TCGTCATTTG TGGTTGGGCC GGTGACCGAC
GAGATCGTCT CGACCCAGCA GGACGTCGCC GACCGCTTCC ATCAGCTAGG CCTGATTCCC
CGCCCGATCG TGGTGCGCGA TGCAGTGTGG CGGCCGCCAC AGGCTTGA
 
Protein sequence
MQRRQFLQLS AGTALLPILS RSAAAEPLTE IRIGYQKNGV LVIARQRRTL EDHFAAQNIG 
IKWLEFSSGP PMLEAMNVGS IHYGAVGDAP PIFAQAAGAA IVYAAGQPIT NGQGILVPKD
SPIRGLADLK GKRIGFTKGS SAHNVVLLAL KKAGLTYGDI TPVYLSPPDA GPAFAQGGID
AWSIWDPYFA IGELKQNGRV LINASEVGRT NSFYIANREF AQRNALILKQ IIDVTSATAR
WAEDHRGDVA QSLSAVTGIP LDIQTIAANR SSFVVGPVTD EIVSTQQDVA DRFHQLGLIP
RPIVVRDAVW RPPQA