Gene RPB_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3329 
Symbol 
ID3911131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3808239 
End bp3809237 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID637885232 
ProductTRAP dicarboxylate transporter DctP subunit 
Protein accessionYP_486936 
Protein GI86750440 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.330686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT CGATATTCGT AGTTGCATCG ATCGCAGCGC TCGCGCTGGT CGGCCCGGCC 
GCGGCGCAGC AGCCGATCGT CGTCAAATTC AGCCACGTGG TGGCGGACAA TACGCCGAAG
GGTCAGGCCG CGATCAAGTT CAAGGAACTG GCGGAGAAGT ACACCAACGG CAAGGTGAAG
GTCGAAGTCT ATCCGAACTC GCAACTGTTC GGCGACGCCA AGGAAATGGA AGCGGTCGCG
CTCGGCGACG TGCAGTTCAT CGCGCCGTCG CTGTCGAAGT TCGACAAGTT CACCAAGCAG
ATTCAGGTGT TCGATCTGCC GTTCCTGTTC AACGACATCG CCGCGGTCGA TCGTTTCCAG
GCCGGAAAGC AGGGGCAGGC TCTGCTGCGC TCGATGGAAT CGAAGAACTT CCTGGGCCTC
GCCTACTGGC ACAACGGCAT GAAGCAGATC TCGGCCAATA GGCCGCTGCT GAAGCCGGAA
GACGCCAAGG GTCTGAAGTT CCGCATCCAG GCGTCGGACA TTCTCGCCGC GCAGTTCCAG
GGCTTGAACG CCACCCCGCA GAAGCTCGCC TTCTCGGAAG TCTATCAGGC GCTGCAGGTC
GGCACCGTCG ACGGCCAGGA GAACACCTGG TCGAACATCT TCTCGCAGAA ATTCTACGAA
GTGCAGAAGG ACATCACCGA GTCTGATCAC GGCGTGATCG ACTACATGGT CGTGGTCAAC
GCCAAGTGGT GGAACGGCCT GTCGAAGGAT CTGCAGGACG CGATGAAGAA GGCGATGGAC
GAGGCCACCA AGGTCAACAA CGACGTCGCC GGCAAGCTCA ACGACGAGGC CAAGCAGAAG
ATCGCGTCCT CCGGCGCCAG CAAGATCCAT CAGCTGACGC CCGAGCAGCG CAAGCAGTGG
GTCGAAGCGA TGAAGCCGGT CTGGGCCAAG TTCGAAAGCG CGATCGGCAA GGACCTGATC
GACGCGGCAG TGGCGTCGAA CGACACGAAG ACCAACTGA
 
Protein sequence
MRKSIFVVAS IAALALVGPA AAQQPIVVKF SHVVADNTPK GQAAIKFKEL AEKYTNGKVK 
VEVYPNSQLF GDAKEMEAVA LGDVQFIAPS LSKFDKFTKQ IQVFDLPFLF NDIAAVDRFQ
AGKQGQALLR SMESKNFLGL AYWHNGMKQI SANRPLLKPE DAKGLKFRIQ ASDILAAQFQ
GLNATPQKLA FSEVYQALQV GTVDGQENTW SNIFSQKFYE VQKDITESDH GVIDYMVVVN
AKWWNGLSKD LQDAMKKAMD EATKVNNDVA GKLNDEAKQK IASSGASKIH QLTPEQRKQW
VEAMKPVWAK FESAIGKDLI DAAVASNDTK TN