Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3329 |
Symbol | |
ID | 3911131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3808239 |
End bp | 3809237 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637885232 |
Product | TRAP dicarboxylate transporter DctP subunit |
Protein accession | YP_486936 |
Protein GI | 86750440 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.330686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAT CGATATTCGT AGTTGCATCG ATCGCAGCGC TCGCGCTGGT CGGCCCGGCC GCGGCGCAGC AGCCGATCGT CGTCAAATTC AGCCACGTGG TGGCGGACAA TACGCCGAAG GGTCAGGCCG CGATCAAGTT CAAGGAACTG GCGGAGAAGT ACACCAACGG CAAGGTGAAG GTCGAAGTCT ATCCGAACTC GCAACTGTTC GGCGACGCCA AGGAAATGGA AGCGGTCGCG CTCGGCGACG TGCAGTTCAT CGCGCCGTCG CTGTCGAAGT TCGACAAGTT CACCAAGCAG ATTCAGGTGT TCGATCTGCC GTTCCTGTTC AACGACATCG CCGCGGTCGA TCGTTTCCAG GCCGGAAAGC AGGGGCAGGC TCTGCTGCGC TCGATGGAAT CGAAGAACTT CCTGGGCCTC GCCTACTGGC ACAACGGCAT GAAGCAGATC TCGGCCAATA GGCCGCTGCT GAAGCCGGAA GACGCCAAGG GTCTGAAGTT CCGCATCCAG GCGTCGGACA TTCTCGCCGC GCAGTTCCAG GGCTTGAACG CCACCCCGCA GAAGCTCGCC TTCTCGGAAG TCTATCAGGC GCTGCAGGTC GGCACCGTCG ACGGCCAGGA GAACACCTGG TCGAACATCT TCTCGCAGAA ATTCTACGAA GTGCAGAAGG ACATCACCGA GTCTGATCAC GGCGTGATCG ACTACATGGT CGTGGTCAAC GCCAAGTGGT GGAACGGCCT GTCGAAGGAT CTGCAGGACG CGATGAAGAA GGCGATGGAC GAGGCCACCA AGGTCAACAA CGACGTCGCC GGCAAGCTCA ACGACGAGGC CAAGCAGAAG ATCGCGTCCT CCGGCGCCAG CAAGATCCAT CAGCTGACGC CCGAGCAGCG CAAGCAGTGG GTCGAAGCGA TGAAGCCGGT CTGGGCCAAG TTCGAAAGCG CGATCGGCAA GGACCTGATC GACGCGGCAG TGGCGTCGAA CGACACGAAG ACCAACTGA
|
Protein sequence | MRKSIFVVAS IAALALVGPA AAQQPIVVKF SHVVADNTPK GQAAIKFKEL AEKYTNGKVK VEVYPNSQLF GDAKEMEAVA LGDVQFIAPS LSKFDKFTKQ IQVFDLPFLF NDIAAVDRFQ AGKQGQALLR SMESKNFLGL AYWHNGMKQI SANRPLLKPE DAKGLKFRIQ ASDILAAQFQ GLNATPQKLA FSEVYQALQV GTVDGQENTW SNIFSQKFYE VQKDITESDH GVIDYMVVVN AKWWNGLSKD LQDAMKKAMD EATKVNNDVA GKLNDEAKQK IASSGASKIH QLTPEQRKQW VEAMKPVWAK FESAIGKDLI DAAVASNDTK TN
|
| |