Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3234 |
Symbol | |
ID | 3971900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3579665 |
End bp | 3580663 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637926345 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_533095 |
Protein GI | 90424725 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.230361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAT TGATTTTGGC CGCGGCGTCG GCCGCGATGT TCGCTGTTGT TGGGCCGGCT GCGGCACAGT CGCCGATCGT CATCAAGTTC AGCCACGTGG TGGCGCCGAA CACCCCGAAG GGTCTGGCCG CCGAAAAATT CAAAGAACTC GCCGAGAAGT ACACCGCCGG CAAGGTCAAG GTCGAGGTCT ACCCGAACTC GCAGCTCTAC AAGGACAAGG AAGAGCTCGA AGCCTTGCAA CTCGGCGCGG TGCAGATGCT GGCGCCGTCG AATTCCAAGT TCGGCCCGAT CGGCGTGCGC GAATTCGAAG TGTTCGATCT GCCCTATATC CTGTCCGATC TGCCGACGCT GCGCAAAGTC ACCGACGGCC CGCTCGGCAC CAAGCTGTTG AAGCTGCTCG ATGGCAAGGG CATGACCGGG CTGGCCTATT GGGACAACGG CTTCAAGATC ATGAGTGCTA ACAAGAAATT GGTGGCGCCG GCGGACTACA AGGGGCTGAA ATTCCGCATC CAGTCCTCCA AGGTGCTCGA CGCCCAGTTC CGCGCGCTCG GCGCGATCCC GCAGGTGATG GCGTTCTCCG AAGTCTATCA GGCGCTGCAG ACCGGCGTGG TCGACGGCCA GGAAAATACG CCCTCCAACA TGTACACCCA GAAAATGCAC GAGGTGCAGA AGTTCACCAC GCTCACCAAT CACGGCTATC TCGGCTACGT GGTGATCGTG AACAAGAAGT TCTGGGATGA CTTGCCGGCC AACCTGCGCA CCGAACTCGA CAAGGCGATG AAAGAGGCCT CGGCCTACGG CAACAGCCTG TCCGCCAAGG AGAACGAAGA CGCGTTGGCC GAGATGCAGA AGTCCGGCAA GACCGAACTC GTCAAGTTGA CGCCGGAGCA GGACGCCGCG ATGCGCAAGG CGATGGAGTC GGTCTACGGC GACGTCGCCA ACCGAGTCGG CAAGCCGCTG ATCGAAGAGT TCCTGAAGGA AACCAAGGCC ACCAATTAA
|
Protein sequence | MRKLILAAAS AAMFAVVGPA AAQSPIVIKF SHVVAPNTPK GLAAEKFKEL AEKYTAGKVK VEVYPNSQLY KDKEELEALQ LGAVQMLAPS NSKFGPIGVR EFEVFDLPYI LSDLPTLRKV TDGPLGTKLL KLLDGKGMTG LAYWDNGFKI MSANKKLVAP ADYKGLKFRI QSSKVLDAQF RALGAIPQVM AFSEVYQALQ TGVVDGQENT PSNMYTQKMH EVQKFTTLTN HGYLGYVVIV NKKFWDDLPA NLRTELDKAM KEASAYGNSL SAKENEDALA EMQKSGKTEL VKLTPEQDAA MRKAMESVYG DVANRVGKPL IEEFLKETKA TN
|
| |