Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3490 |
Symbol | |
ID | 3972857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3876026 |
End bp | 3877615 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637926602 |
Product | extracellular solute-binding protein |
Protein accession | YP_533349 |
Protein GI | 90424979 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC GTGAATTTGC CAAATTGGGT CTGGCCGCGG GCATGGTCAG CATCGGCGGT TTTCCGATCG GCGTGACCCG TGCGGCGGGA CAGAGCCGCG GCGGCACGCT CAACACCATC ATCCAGCCGG AGCCGCCGAT CCTGGTCACC GCGCTCAATC AGCAGCAGCC GACGCTGACG CTCGGCGGCA AGATCTATGA GAGCCTGCTG CGCTACGATT TCGATCTGAA GCCGATGCCC GGCCTGGCGC AGTCCTGGGA GATCTCGCCG GACCGGCTGA CCTACACCTT CAAGCTGTAT CCGAACATCA CCTTCCACGA CGGCACGCCG CTGACCTCCG AGGACGTGGT GTTCTCGGTG ATGAAGATTC TGATCGAGAA CCACTCCCGC GCCCGCAGCA CCTTCTCGCG GATCGACAAG GCGGAGGCGC CGGACCCGGT CACCGTGGTG TTCAAGCTGA AAGCGCCGTT CTCGCCGTTC CTCACCGCGT TCGATTGCAC CACCGCGCCG ATCGTGCCGA AGCACATCTA TGAAGGCACC GACTTCCGCA AGAACCCGGC CAATGCGCGA GCAATCGGCT GCGGTCCGTT CAAGCTGAAG GAATGGGTGC GCGGCTCGCA CGTGCATCTG GTCAGGCACG AGGGCTACTA TCGTCCCGGC GAACCCTATC TCGACGAGAT CATCTATCGG GTGATCCCGG ATTCGGCGTC GCGCTCGGTG GCGCTGGAGC AGGGCACCGT GCAGCTGACG CAGTGGTCGG ACGTGGAGTC GTTCGAGGTG CAGCGGCTGT CGAAGCTGCC CAATCTCGCG ATGACCACCA AAGGCTATGA ATTCTTCGCG CCGCATCAGT GGCTGGAATT CAACAACCGC ATCGCGCCGA TGAACGACAA GCGCTTCCGC CAGGCGGTGC TGTTCGCGAT CGATCGCAAG GCGCTGCTCA ATCGGGTGTA TTTCGGCCTC GGCAAGGTCG CCACCGGGCC GGTGTCGTCG AAGACCAAAT TCTACGAGAA GGACGTCAAG CCCTACGACT ACTCGCTCGA CAAGGCTAAG GCGCTGCTCG ACGAGATGGG CCTGAAGCCC GGTGCCGACG GCAAGCGCGT CAGCATTCCC TATCTGGTGC CGCCTTACGG CGAATCGCAT CAGCGCACCG CGGAGTTCAT TCGCCAGTCG CTGGCGCGCG TCGGCATCGA CCTGCAGCTG CAGGGCATCG ATCTCGCCGG CTGGGCCGAC AAGTACAGCA ACTGGGATTT CTCGATGACC GCGACCGTGG TCTATCAGTT CGGCGATCCG GCGCTCGGCG TGGCGCGGAC CTACGTCTCC TCCAACATCC ACAAGGGCAT TCTGTTCTCC AACACCGCCG GCTATTCCAA TCCGGAGGTC GACCGGCTGT TCGAGGAAGC CGCGGTAGCG GGCGACGATG CCAAGCGCCA GCAATGCTAC AGCGACGCGC AGAAGCTGAT CGTCGAGGAC GTCCCGGTGG CCTGGCTGCT GGAGATGGAT TATCCGAATT TCATGGATAA GCGGCTGAAG AACGTCATCA CCACCGCGAT CGGCGTGCAC GACACCTTCG GCACGGTCTC GTTCGCATGA
|
Protein sequence | MNRREFAKLG LAAGMVSIGG FPIGVTRAAG QSRGGTLNTI IQPEPPILVT ALNQQQPTLT LGGKIYESLL RYDFDLKPMP GLAQSWEISP DRLTYTFKLY PNITFHDGTP LTSEDVVFSV MKILIENHSR ARSTFSRIDK AEAPDPVTVV FKLKAPFSPF LTAFDCTTAP IVPKHIYEGT DFRKNPANAR AIGCGPFKLK EWVRGSHVHL VRHEGYYRPG EPYLDEIIYR VIPDSASRSV ALEQGTVQLT QWSDVESFEV QRLSKLPNLA MTTKGYEFFA PHQWLEFNNR IAPMNDKRFR QAVLFAIDRK ALLNRVYFGL GKVATGPVSS KTKFYEKDVK PYDYSLDKAK ALLDEMGLKP GADGKRVSIP YLVPPYGESH QRTAEFIRQS LARVGIDLQL QGIDLAGWAD KYSNWDFSMT ATVVYQFGDP ALGVARTYVS SNIHKGILFS NTAGYSNPEV DRLFEEAAVA GDDAKRQQCY SDAQKLIVED VPVAWLLEMD YPNFMDKRLK NVITTAIGVH DTFGTVSFA
|
| |