Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1917 |
Symbol | |
ID | 3835341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2221272 |
End bp | 2222828 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637826016 |
Product | extracellular solute-binding protein |
Protein accession | YP_427004 |
Protein GI | 83593252 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAGT TCCTCAAGGC CAGCGTATTC ACCCTGTCCA TCGCCCTGAC GCCCTTCGCC ACCGTTCACG CGGCCACGCC GAAGGATACC CTGGTGATGG CCTGGGTCTT CGACGACATC GTCACCCTCG ACCCCGCCGA GATCTACGAA GTCTCGGGCT CCGAGTTCAT GGCCAATGTC TACGATCGTC TGGTCACCCT CGACGAAAAA GACCCGTCGA AGCTGCGCAA TGTCATCGCC GAAAGCTGGT CGGTCTCCGA AGACGGCAAG ACCTTCACCT TCAAGATCCG CCCCAACAAC ACCTTCTTTT CAGGCAATCC GGTGACGGGC GAGGACGTGG AGTTCTCGGT CGAGCGCTAT GTGCTGCTTG ATAAGAACCC GGCCTTCATC ATGAACCAGT TCGGCCTGAC CAAGGAGAAC GTCCGCGACA AGGTGCGGCT GGTCGATCCG ATGACGGTGG AAATCGAGCT TGATCAGGCC TATGCGCCGT CGTTCTTCCT GAACTGCGTC AGCTATACCA CGGCGGTTGT CGACAAGAAG GAAGTGCTCT CCCACGAGGA GAACGGCGAT TTCGGCAATG GCTGGCTCAA GCGCAATTAC GCCGGTTCGG GCCCCTTCTT CCTGAAGGCC TGGAAGCCCA ATGAATCGCT GGTGCTGGAA GCCAATCCCA AGTACTTCGA TGGCGCGCCG ACGATCAAGC GCGTGCTGGC CAAGAACGTC GTGGAAAGCG CCACCCAGCA GCTTCTGCTC GAAAAGGGCG ATGTCGACGT GGCGCGCAAT CTGACCGGCG ATCAGCTCGC CGCCGTGCGC AAGAACGCCG ATATCACCAT CGACGCCGAG ACCAAGGCGA CGCTGTGGTA CATGGGCCTC AACGTGAAAA ACCCGATCCT GGCCAAGCCC GAAGTGCGTC AGGCGATGAA GTATCTGGTC GATTACAAGG GGATGGCCGA CAGCATCTTC GCCGGCACCG GCACCATCCA CCAGACCTTC GTTCCCAGCC GCCAGTTGGG CGCCCTTGAT GAGACGCCGT TTTCCTTCGA TCTCGCCAAG GCCAAGGAGC TGCTGGCCAA GGCCGGGCTG CCCGATGGCT TCTCGGTGAC CATGGACACC ACCAACAAGT CCGAGACCCG CAATCTCGCC GACGCCATCC AGTCGTCGAT GAGCAAGGCC GGCATCAAGA TCGAACTCAA GGTCGCCGAC AACAAGACCA CCCTGACCCG CTACCGCGCC AGCGAGCATG ATATCTACAT CGGCCAGTGG GGCTCCGATT ACTGGGATCC CCATTCCAAC ACCGACGGCT ATCTCAACGC GCCCTTGGCC AAGCGCAATC AGTGGGAGGT CCCGGGGCTG GTCGACAAGG TCTTCGCCGC CCGCGACGAG AAGGACCCGG TCAAGCGCGC CCAGATGTAC AAGGACCTGC AAAGCATGGC CCTTGAGGAA AGCCCCTATG TGATTATCCT CCAGCAGGTG GAAAACGCCG CCGTTCGCAA GGAGGTCAAG GGCTTCGTCC TCGGCCCGAC GTTCGACCTC AACCTTTATC GTCACGTCAC GAAGTAA
|
Protein sequence | MKQFLKASVF TLSIALTPFA TVHAATPKDT LVMAWVFDDI VTLDPAEIYE VSGSEFMANV YDRLVTLDEK DPSKLRNVIA ESWSVSEDGK TFTFKIRPNN TFFSGNPVTG EDVEFSVERY VLLDKNPAFI MNQFGLTKEN VRDKVRLVDP MTVEIELDQA YAPSFFLNCV SYTTAVVDKK EVLSHEENGD FGNGWLKRNY AGSGPFFLKA WKPNESLVLE ANPKYFDGAP TIKRVLAKNV VESATQQLLL EKGDVDVARN LTGDQLAAVR KNADITIDAE TKATLWYMGL NVKNPILAKP EVRQAMKYLV DYKGMADSIF AGTGTIHQTF VPSRQLGALD ETPFSFDLAK AKELLAKAGL PDGFSVTMDT TNKSETRNLA DAIQSSMSKA GIKIELKVAD NKTTLTRYRA SEHDIYIGQW GSDYWDPHSN TDGYLNAPLA KRNQWEVPGL VDKVFAARDE KDPVKRAQMY KDLQSMALEE SPYVIILQQV ENAAVRKEVK GFVLGPTFDL NLYRHVTK
|
| |