Gene RPC_1367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1367 
Symbol 
ID3972463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1491378 
End bp1493165 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content62% 
IMG OID637924482 
Productextracellular solute-binding protein 
Protein accessionYP_531248 
Protein GI90422878 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACACG CAATCGGACA GCGCCGTCGA TCGACGCGCG CGATTTTTCT CGGCATGGCG 
AGCGCCGCGG CGCTGATCGC GGTCTCGACG GCGCCGGCGC TTGCCGACGA CGCCACCGCG
CAGAAATGGA TCGACGAGGA ATTCCAGCCC TCGACGCTGT CGAAGGAAGA TCAGCTCAAG
GAGCTGCAAT GGTTCGCCAA GGCGGCCGAG CCGTTCAAGG GCATGGACAT CAACGTCGTC
TCCGAAACCA TCACCACCCA CGAATACGAA GCCAAAACGC TGGCCAAGGC GTTTTCGGAA
ATCACCGGCA TCAAGCTCAA GCACGATTTG ATCCAGGAAG GCGACGTGGT CGAGAAGCTG
CAGACCCAGA TGCAGTCCGG CAAGAACGTC TATGACGGCT GGATCAACGA CAGCGATCTG
ATCGGCACGC ATTTCCGCTA CGGCCAGACC ATCGCGCTGT CCGACTACAT GACCGGCGAG
GGCAAGGACG TCACCGACCC GATGCTGGAC ATCGACGACT TTATCGGCCG TTCGTTCACC
ACCGCGCCCG ACAAGAAAAT GTATCAGCTG CCGGACCAGC AGTTCGCCAA CCTGTATTGG
TTCAGGTACG ACTGGTTCAC CAATCCGGAC TACAAATCGA AGTTCAAGGC GAAATACGGC
TACGACCTCG GCGTGCCGGT GAATTGGTCG GCCTATGAGG ACATCGCCGA GTTCTTCACC
AACGACGTCA AGGAGATCAA CGGCGTCAAG GTCTATGGCC ACATGGATTA TGGCAAGAAG
GATCCGTCGC TCGGCTGGCG CTTCACCGAC GCCTGGCTGT CGATGGCCGG CAACGGCGAT
CGCGGCATTC CCAACGGCCT GCCGGTCGAC GAATGGGGCG TCCGCATGGA AGGCTGCCGT
CCGGTCGGCT CCTCGATCGA GCGCGGCGGC GACACCAACG GCCCTGCGGC GGTGTATTCG
ATCGTCAAAT ATCTCGACTG GATGAAGAAA TACGCGCCGC CGCAGGCCCA GGGCATGACC
TTCTCGGAAT CGGGTCCGGT GCCGGCGCAG GGCAACGTCG CCCAGCAGAT GTTCTGGTAC
ACCGCCTTCA CCGCCGACAT GGTGAAGCCG GGTCTTGCGG TGATGAACGC CGACGGCACG
CCGAAGTGGC GGATGGCGCC GTCGCCGCAT GGCGCGTATT GGAAAGACGG CATGAAGCTC
GGCTATCAGG ACGTCGGCTC CGGCACGCTG TTGAAGTCGA CGCCGCCGGA TCGCCGCAAG
GCGGCGTGGC TGTATCTGCA GTTCATCACC TCGAAGACTG TCAGCTTGAA GAAGAGCCAC
GTCGGTCTCA CCTTCATCCG CGAGTCGGAT ATCTGGGATA AGTCCTTTAC GGAACGGGCG
CCAAAACTCG GCGGCCTGAT CGAGTTCTAT CGCTCGCCGG CCCGTACGCA ATGGTCGCCG
ACCGGCAACA ACATCCCGGA CTATCCGAAG CTGGCGCAAT TGTGGTGGCA GAACATCGGC
GACGCGGCCT CGGGCGCCAA GACCGCGCAG GCGGCGATGG ACTCGCTGGC GGCGGCGCAG
GATTCGGTGC TGGAGCGGCT CGAGCGCTCC AAGGTGCAGG GTGACTGCGG CCCGAAGCTG
AACAAGAAGG AGACCGCCGA GTTCTGGTAC AAGAAGTCGG AAAAGGACGG CAACATCGCG
CCGCAACGCA AGCTCGCCAA CGAGAAGCCG AAGGGCGAAA CCATCGACTA CGACACGCTG
ATCAAGTCGT GGCCGGCGTC GCCGCCGAAG CGCGCCTCGC TGAACTGA
 
Protein sequence
MQHAIGQRRR STRAIFLGMA SAAALIAVST APALADDATA QKWIDEEFQP STLSKEDQLK 
ELQWFAKAAE PFKGMDINVV SETITTHEYE AKTLAKAFSE ITGIKLKHDL IQEGDVVEKL
QTQMQSGKNV YDGWINDSDL IGTHFRYGQT IALSDYMTGE GKDVTDPMLD IDDFIGRSFT
TAPDKKMYQL PDQQFANLYW FRYDWFTNPD YKSKFKAKYG YDLGVPVNWS AYEDIAEFFT
NDVKEINGVK VYGHMDYGKK DPSLGWRFTD AWLSMAGNGD RGIPNGLPVD EWGVRMEGCR
PVGSSIERGG DTNGPAAVYS IVKYLDWMKK YAPPQAQGMT FSESGPVPAQ GNVAQQMFWY
TAFTADMVKP GLAVMNADGT PKWRMAPSPH GAYWKDGMKL GYQDVGSGTL LKSTPPDRRK
AAWLYLQFIT SKTVSLKKSH VGLTFIRESD IWDKSFTERA PKLGGLIEFY RSPARTQWSP
TGNNIPDYPK LAQLWWQNIG DAASGAKTAQ AAMDSLAAAQ DSVLERLERS KVQGDCGPKL
NKKETAEFWY KKSEKDGNIA PQRKLANEKP KGETIDYDTL IKSWPASPPK RASLN