Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0100 |
Symbol | |
ID | 3909686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 107004 |
End bp | 108446 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881981 |
Product | TRAP transporter solute receptor TAXI family protein |
Protein accession | YP_483723 |
Protein GI | 86747227 |
COG category | [R] General function prediction only |
COG ID | [COG2358] TRAP-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | [TIGR02122] TRAP transporter solute receptor, TAXI family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTCA AGCGACTATT CGCCCGGTCG ATGGACGAGC CCGAGCCGGT GGACCCTTTG CTGCATATGT CTGCGCGCCC GGTGCGACGC AAGACGACGC TGATTTCGCT GGCCGCGATC CTGGCGCTGG TCGGCGCCAT CAGTGCTGCG TATTATTTCG CGATGCGGCC GACGACGCTG AAGATCGCCG TCGGTCCGCA GGCCAGCGAC GATCTGCGGC TGATCCAGGC GCTGGCGCAG GCGTTCTCGC GCGAGCGCAA CATCGTCCGG ATGCGGCCCA TCGTCACCGA CGGCCCGGCC GCCAGCGCCG CGGCGCTGAA ATCAGACACC ACCGATCTTG CGGTGATCCG CGGCGATCTG CCGGTGCCGC GCAACGCCCG CTCGGTCGCG GTGCTGCACA AGAACGTCGC CGTGCTGTGG GCGCCCGGCC GGCCCGCGAG CGGCAAACGC AGGAAAGCCG CTGTCGCCGG GGTCACTACC ATCACCCAGC TGGTCGGCAA GCGCGTCGGC GTGATCGGCC GCACCGAAGC CAATGCGGGG CTGCTCGCGG TGATCCTGCG CCAATACGGC ATCGACCCCG CCAAGGTCGA GAGCGTGCAG CTCACCGCGG CCGACGTCGC AGAGGCGGTG CGGACCGGCA AGGCCGACGC GTTTCTCGCG GCCGGACCGC TCAACAGCAA GGTGATCGGC GAGGCTTTGG CGGCCACTGC GAGTTCCGGC CGGGAGCCGG TGTTCCTCGG CATCGATTCG TCCGAAGCGC TGGCCGCCAA CCATCCGTCG TATGAATCGG CGTCGATCCC CGCCGGCGCC TTCGGCGGCG CCCCGGCGCG GCCGGGCGAC GACGTCAAGA CCATCAGCTT CTCGCATTAC ATCGTGGCGC GCGACGGCGT CTCCGACGCC ACCATCGCGA GTTTCACCCA ACAATTGTTC ACGGCGCGCC AGACCGTGAT GACGGAGAAT CCGCTGGCCG CGAAGATCGA GACGCCCGAC ACCGACAAGG ACGCGGTGAT CCCGGTGCAG GCGGGTGCCG CTGCCTATGT CGACGGCGAG CAGCGCAGCT TTCTCGACCG CTACAGTGAC CTGATCTGGT TTTCGCTGAT GGGGCTGTCG GCGACCGGCT CGCTCGGCGC CTGGTTCGCG AGCTATCTGC GGAAAGACGA ACGCAACACC AACGCCTCGC AGCGCGACCG GCTGCTCGAC ATGCTGGCGG CGGCGCGGCG CTGCGACGCG CAGGACGAAC TCGACGCGAT GCAGACCGAG GCCGATGCGA TCCTGCGCGA CGCGCTGAAC TGCTACGAGA ACGGCGCGAT CGACAGCGCG GCGCTGACCG CCTTCAGCAT CGCGCTGGAG CAGTTCCACA ACGCCGTGGT CGATCGCAAG ATGCTGCTGG CGGCGATCCC GCCGGCGCCA CCGATCCGCC CGGCGCGGCC GCAGGTGGTG TGA
|
Protein sequence | MDFKRLFARS MDEPEPVDPL LHMSARPVRR KTTLISLAAI LALVGAISAA YYFAMRPTTL KIAVGPQASD DLRLIQALAQ AFSRERNIVR MRPIVTDGPA ASAAALKSDT TDLAVIRGDL PVPRNARSVA VLHKNVAVLW APGRPASGKR RKAAVAGVTT ITQLVGKRVG VIGRTEANAG LLAVILRQYG IDPAKVESVQ LTAADVAEAV RTGKADAFLA AGPLNSKVIG EALAATASSG REPVFLGIDS SEALAANHPS YESASIPAGA FGGAPARPGD DVKTISFSHY IVARDGVSDA TIASFTQQLF TARQTVMTEN PLAAKIETPD TDKDAVIPVQ AGAAAYVDGE QRSFLDRYSD LIWFSLMGLS ATGSLGAWFA SYLRKDERNT NASQRDRLLD MLAAARRCDA QDELDAMQTE ADAILRDALN CYENGAIDSA ALTAFSIALE QFHNAVVDRK MLLAAIPPAP PIRPARPQVV
|
| |