Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2941 |
Symbol | |
ID | 6410611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3213862 |
End bp | 3215742 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642712822 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001991924 |
Protein GI | 192291319 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGACGCG CCGAATCAGA TCATCGCTTC ATCCGCCGGA TCAAGGCCGT TGCCGCCGCG CTCGCGATCG GTACCGGCTG CTTCACCGGC GTAACGACGG CCGGCGCCGA GTCGGTCGTG ACTCCGGCAT CCGGATTCGT CAGCCACGCC ATCGCGATGC ACGGCAAGCC CGCGCAGCCG GCCGACTTTA CCGCGATGCC CTACGTCAAT CCGGATGCCC CCAAGGGCGG CCGGATGATT GCCAGCGTCG TCGGCGCCTT CGACAGCCTC AATCCGCTGA TCGTCAAAGG CATCGCAGTC CAGCAGATGC GCGGCTACGT GATCGAGAGC CTGCTGGCGC GCGGCCACGA CGAGCCGTTC ACCTTGTACG GCCTGCTCGC GCGCTCGGTC GAAACCGACG ACATTCGCAG CTACGTCACA TTTCGGATCG ATCCGCGCGC CCGGTTTGCC GACGGCAAGC CGGTCACCGC CGAGGACGTG CTGTTCTCCT GGAAGCTACT CCGCGACAAG GGCCGGCCGA ACCTGCGCTT GTATTACCGC AAGGTCGCCA GCGCCGAAGC GCTCGATCCC CTCACCGTCC GGTTCGACTT CGGCGGCGCG CCGGACCGCG AGCTGCCACT AATCCTGGGC CTGATGCCGG TGCTGCCGCA CCACGCGGTC GAGCCCGACA GCTTCGAAGA AACCTCCCTC AAGCCGCCGC TCGGCTCCGG CCCGTACCGT GTCACCCACG TTCAGGCCGG TACCAGCGTG ACCCTGACGC GCGATCCGAA CTATTGGGGC CGTGATCTGC CGATCAACCG CGGCCTGTGG AATTTCGACG AAGTGCGGAT CGATTACTTC CGGGAGTCCA ATGCGCAGTT CGAAGCATTC AAACGCGGCC TCTACGACTT CCGCGTCGAG ACCGATCCGC TACGCTGGAG CGAAGGTTAC GACTTCCCCG CCGCCCGTCG CGGTGACGTG ATCCGGGACG CAATCAAACC GGGCACGCCG GAGCCGACCG ACACGCTGGT GTTCAACACC CGCCGGCCGG TGTTTGCCGA CGTCCGGGTG CGTGAGGCGC TGCTGAACCT GTTCGACTTC GAATGGATCA ATCGCAACTA CTTCTTCGGC CTCTACACCC GCACCGCAGG CTTCTTCGGA GGCTCGGAGC TGTCGGCCTA TGGCCGCGCG GCTGACGAAG GCGAGCGCAA GCTGCTGCAG CCGTATCTGC CTCGGCTCCG CGCCGACATC CTCGACGGCA GTTTCCGGCT CCCGGAGAGT GACGGTTCAG GCCGCGACCG CGCCGGCCTG CGCCGGGCCC TCAATCTGCT GGAGCAGGCC GGCTACCAGC TCGACGGCAC GGTCCTGCGC AAACGCGACA CCCGCCAGCC CCTCACCTTC GAGCTGCTGG TGACGACGCG GGAGCAGGAG CGGATCGCGC TGGCGTTCGC CCGTGACCTC AAGCGCGCCG GCATCACCGT GTCGGTCCGC ACCGTGGATG CCGTGCAGTT CGATCAGCGC CGCCTCGCCT ATGACTTCGA TATGATCCCG AACCGCTGGG ACCAGTCGCT GTCGCCCGGC AACGAGCAGT CGTTTTACTG GGGCAGCGAC GCCGCCGAGA CCCCGGGCAC CCGCAACTAC ATGGGCGCCA AGGATCCGGC GATCGACGCG ATGATTGCCG CGCTGCTACG CGCCCGCGGC CGCACCGATT TCGTCGACGC CGTGCGCGCG CTCGATCGCG TCCTGATCTC GGGGTTTTAC GTGGTCCCTG TGTACAGCGT GCGTGAGCAA TGGATCGCGC GCTGGAATCG TTTAGAACGT CCAAAGGCCA CTGCCCTGAC GGGTTATCTT CCCGAAACAT GGTGGTATCG GCCGCCGCCG CAGCAACCGC CGCAGAGGTG A
|
Protein sequence | MGRAESDHRF IRRIKAVAAA LAIGTGCFTG VTTAGAESVV TPASGFVSHA IAMHGKPAQP ADFTAMPYVN PDAPKGGRMI ASVVGAFDSL NPLIVKGIAV QQMRGYVIES LLARGHDEPF TLYGLLARSV ETDDIRSYVT FRIDPRARFA DGKPVTAEDV LFSWKLLRDK GRPNLRLYYR KVASAEALDP LTVRFDFGGA PDRELPLILG LMPVLPHHAV EPDSFEETSL KPPLGSGPYR VTHVQAGTSV TLTRDPNYWG RDLPINRGLW NFDEVRIDYF RESNAQFEAF KRGLYDFRVE TDPLRWSEGY DFPAARRGDV IRDAIKPGTP EPTDTLVFNT RRPVFADVRV REALLNLFDF EWINRNYFFG LYTRTAGFFG GSELSAYGRA ADEGERKLLQ PYLPRLRADI LDGSFRLPES DGSGRDRAGL RRALNLLEQA GYQLDGTVLR KRDTRQPLTF ELLVTTREQE RIALAFARDL KRAGITVSVR TVDAVQFDQR RLAYDFDMIP NRWDQSLSPG NEQSFYWGSD AAETPGTRNY MGAKDPAIDA MIAALLRARG RTDFVDAVRA LDRVLISGFY VVPVYSVREQ WIARWNRLER PKATALTGYL PETWWYRPPP QQPPQR
|
| |