Gene RPC_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3647 
Symbol 
ID3972018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4056490 
End bp4057557 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content65% 
IMG OID637926756 
ProductABC transporter related 
Protein accessionYP_533501 
Protein GI90425131 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.265226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCGG TGCAGATCCA CGACGTGCGG AAATCTTTCG GCGGCTTCGA AGTGTTGCAC 
GGCGTCAGCG TCCCGATCGA GGACGGCGCC TTTGTGGTGC TGGTCGGCCC CTCCGGCTGC
GGCAAGTCCA CCTTACTACG GATGCTGGCG GGCCTGGAAA AGATCACCTC CGGCACCATC
TCGATCGGCG ACCGCGTGGT CAACGACGTG CAACCGAAAG AGCGCGACAT CGCCATGGTG
TTCCAGAACT ACGCGCTGTA TCCGCATATG ACGGTGGCGC AGAACATGGG GTTTTCGCTG
AAGCTGCGCG GCACCGAGCA GGCGGTGATC GACGAGAAGG TCAACCGCGC CGCCGACATT
CTCGATCTGC GCAAGCTGCT CGACCGCTAT CCGCGGCAAC TCTCCGGCGG CCAGCGCCAG
CGCGTCGCAA TGGGCCGCGC CATCGTCCGC GATCCGCAGG TGTTTCTGTT CGACGAGCCG
TTGTCGAACC TCGATGCCAA GCTGCGGGTG GCGATGCGTA CCGAAATCAA GGAACTGCAC
CAGCGGCTGA AGACCACCAC GGTCTACGTC ACCCACGACC AGATCGAGGC GATGACCATG
GCCGACAAGA TCGTGGTGAT GCAGGACGGC ATCGTCGAGC AGATGGGCTC GCCGCTCGAC
CTCTACGACC GCCCCGACAA CAAATTCGTC GCCGGCTTCA TCGGCTCGCC GGCGATGAAT
TTCCTCGCCG GCGAACTCAA GGTCAATGGC GGCCAGCCCT GGGTGGAGAC CGCGAGCGGC
GCCAGGCTGC CGATCGAAGC GGCGCCGGCC TCGGCCAACG GCAAGGCGGT GACCTATGGT
ATCAGGCCCG AGCATCTGGA ATTTTCCGAC GACGGCATCG AGGCCGAAGT GGTGGTGGTG
GAGCCGACCG GATCGGAAAC CCAGATCGTG GCGCGGGTCG GCGCCCAGGA GCTGATCGCC
ATTTTCCGCG ACCGCCGCAA CGTGCAGCCC GGCGACCGGA TTTATCTGAA GCCGCGCGCT
AGCGCCGCCC ATCTGTTCGA CGACGCCACC GGCAAGCGAC TGTCCTGA
 
Protein sequence
MASVQIHDVR KSFGGFEVLH GVSVPIEDGA FVVLVGPSGC GKSTLLRMLA GLEKITSGTI 
SIGDRVVNDV QPKERDIAMV FQNYALYPHM TVAQNMGFSL KLRGTEQAVI DEKVNRAADI
LDLRKLLDRY PRQLSGGQRQ RVAMGRAIVR DPQVFLFDEP LSNLDAKLRV AMRTEIKELH
QRLKTTTVYV THDQIEAMTM ADKIVVMQDG IVEQMGSPLD LYDRPDNKFV AGFIGSPAMN
FLAGELKVNG GQPWVETASG ARLPIEAAPA SANGKAVTYG IRPEHLEFSD DGIEAEVVVV
EPTGSETQIV ARVGAQELIA IFRDRRNVQP GDRIYLKPRA SAAHLFDDAT GKRLS