Gene Rpal_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0952 
Symbol 
ID6408606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1013011 
End bp1014009 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content66% 
IMG OID642710866 
Productputative substrate-binding component of ABC transporter 
Protein accessionYP_001989985 
Protein GI192289380 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0791916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGAG TTGTCGCCGC CCTGTTCGCC GTCCTGCTGA CCGCCGTGCC GGCGGCTGCG 
CAGACGCCGC TGAAGATCAT GGTGGGCGGC ATCGACAAGC AGATCTATCT GCCGGCCAAG
CTCGCCGCCC AGCTCGGTTT CTTCAAGGAG GAAGGGCTCG ATGTTGAGCT GTTCAACTCC
ACCTCAGGCT CGCAGGCCGC CACCGCGCTG TTGGCGCGCG AAGTCCAGGG CGTCGTCGGA
TTCTACGATC ACACCATCGA CCTGCAGGCC AAGGGCAAGT TCATCACCGA CGTGGTGCAG
TTCTCGGTCG CGCCCGGCGA GGTCGTGCTG GTGAAGGCGG CCGAAGCTGA CAAGCTGAAG
CAGCCCGCGA ACTGGAAGGG CTTGGCGCTC GGCGTGACCG GCCTCGGCTC GGCCACCGAC
TTTCTCACCC GCGCGCTCGC CGCCAAGGCC GGCCTGAAGA TGCAGGACTA CACGCTGGTG
CCGGTCGGCG CCGGCGACAC CTTCCTGGCG GCGATGCAGC AGGGCAAGAT CAGCTCGGGC
ATGACCACCG AGCCGACCGT GCAGCGCGCG CTGAGTTCAG GCACCGCCAA GATCGGCATC
GACCTGCGCT CGCCGGAGCA GACCCGCAAG GCGCTCGGCG GCGACTATCC GGCCGCTTGC
CTGTACATGG ACCGCGGCTG GATGGAGGCA AACAAGCCCA CCGTGCAGAA GCTGGTGAAC
GCCTTCGTCA AGACGCTGAA ATGGATCCAG GCGCATTCGG CCGAAGAGAT CGCCGACAAG
ATGCCGAAGG ATTACTACGC CGGCGACCGC GCCCTCTACG TCCAAGGCCT GCAGGACGGC
AAGGTGCAGT ACTCGCCCGA CGGCATGATG CCGGCCGGCG CTCCGGAATC CGTCGCCAAG
ATCCTTGCCA GCTTCTCGCC CAACCTGCAG GGCAAGACGA TCGATCTGGC CAAGACCTAC
ACGACGGAGT TCGTGGTGAA GGCGAATGCG GGCCAGTGA
 
Protein sequence
MRRVVAALFA VLLTAVPAAA QTPLKIMVGG IDKQIYLPAK LAAQLGFFKE EGLDVELFNS 
TSGSQAATAL LAREVQGVVG FYDHTIDLQA KGKFITDVVQ FSVAPGEVVL VKAAEADKLK
QPANWKGLAL GVTGLGSATD FLTRALAAKA GLKMQDYTLV PVGAGDTFLA AMQQGKISSG
MTTEPTVQRA LSSGTAKIGI DLRSPEQTRK ALGGDYPAAC LYMDRGWMEA NKPTVQKLVN
AFVKTLKWIQ AHSAEEIADK MPKDYYAGDR ALYVQGLQDG KVQYSPDGMM PAGAPESVAK
ILASFSPNLQ GKTIDLAKTY TTEFVVKANA GQ