Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0799 |
Symbol | |
ID | 3834432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 953763 |
End bp | 955358 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637824890 |
Product | extracellular solute-binding protein |
Protein accession | YP_425890 |
Protein GI | 83592138 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.162067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACCA TGCGGTGGGT GTTGCTCGCG GCTTCGGCCG CTTTGTTGAT GGCGCCGGCG TCTTTCGCCG TCGCCGGTAC GCCAAAAGAC ACCTTGGTTT TCGCCCGTAA CATCGACGAG ATCATCTCGC TCGATCCGGC GGAAAACTTC GAAAGCGTCG GCGGAGAGAC CCTTAACAAT CTTTACACCC GCCTTGTGAC CTTCGCGCCG GGGGATTTCG GCACGCTGGT CGGCGGAGCG GCCGAAAGCT GGACGGTCAG CGAGGATGGC AAGGTCTTCA CCTTCAAGAT CCGTCCGGGT CTGACCTTCC CCTCGGGCCG TCCGCTGCGC GCCCAGGACG CCGCCTTTTC CTTGCAACGC GCCGTTATCT TGGAAAAGAC CCCGGCGTTC ATCCTGACCC AGTTCGGCTG GACCAAGGAC AACGTCAAAG ACCTGATCCA GGCTCCCTCC GACGATACGC TGCGGCTGAC CCTTCCCGAG GAACGGGCGC CCAGTCTGGT GATCAATTCC CTGACCGCCA CCGTCGCCTC GGTGGTCGAC GCCGAGGAAG CCTTGGCCCA CGCCAAAGAT AGTGATCTGG GGTCGGAGTG GCTGAAGACC ACCGCGGCGG GGACGGGGGC CTATCGCCTG ATCGAGTGGA AGCCCAAGCA GGCGCTAAGC TTTGAGGCCA ACCCCAAGTA TTACCTTGGC GAGGTCCCCT TGAAGCGGGT GATCGTCCGC CATATTCCCG ATCCATCGAG CCAGCGGTTG CTGCTTGAAA AGGGCGATGT CGATATCGCC CGCGACCTCA CGCCCGATCA GATCGCCACC TTGGAAGGCA AAAAAAACTT CCATATCTGG ACCGATCCGA AGCAGACGGT CTATTACCTG AACCTCAACC TGAAAAACCC CGAGCTGGCC AAACCCAAGG TGCGCGAGGC GATCCGCTGG CTGGTCGATT ACAAGGGTCT GGCCGAGACG GTTTTGAAGG GACGGGTCGA GGTTCATCAG GCGATCATCG GCAAGGGAAC CTTCGGCTCG CTCGACGAAA CCCCCTTCCG CCTCGACGTG GCCAAGGCCA AGGCCCTGCT CGCCGAAGCC GGGGTGGCCG ACGGCTTCAA GATCACCATC GACAGCGCCA ATTCCCCACC GGCCTCCGAT ATCGCCCAGT CCCTGCAATC GACCTTCGCC CAGGCTGGCA TCACCCTTGA GATCATCCAG AGCGATCGCA AGCAGTTGCT GACCAAATAC CGGGCGAGGG CGCACGATAT CGTGATCATC ACCTGGGAAC CCGATTACCT CGACCCCCAT TCCTCGACGG ATTATTTCAC GCGCAATACG GATAATTCCG ATAGCGCCAA GTCGAAGACC CTGGCGTGGC GGGCGTCGTG GGATATCCCG GAGCTGACCC GCGAGACGGA CAAGGCGGTG CTGGAGATCG ATACCGAGAA GCGCAAGGCC GCCTATCTCG CCTTGCAACG CGAAATCCAG ACCGACTCGC CGATCGTCGT GCTCTTCCAG AAGGTCGACC AGACCGCCGG ACGCCAAGAG GTCACCGGCT TCGACTCGGG ACCAACCGGC GATACGCTGG TTTATGCCCG CATTCGGAAA GATTAG
|
Protein sequence | MGTMRWVLLA ASAALLMAPA SFAVAGTPKD TLVFARNIDE IISLDPAENF ESVGGETLNN LYTRLVTFAP GDFGTLVGGA AESWTVSEDG KVFTFKIRPG LTFPSGRPLR AQDAAFSLQR AVILEKTPAF ILTQFGWTKD NVKDLIQAPS DDTLRLTLPE ERAPSLVINS LTATVASVVD AEEALAHAKD SDLGSEWLKT TAAGTGAYRL IEWKPKQALS FEANPKYYLG EVPLKRVIVR HIPDPSSQRL LLEKGDVDIA RDLTPDQIAT LEGKKNFHIW TDPKQTVYYL NLNLKNPELA KPKVREAIRW LVDYKGLAET VLKGRVEVHQ AIIGKGTFGS LDETPFRLDV AKAKALLAEA GVADGFKITI DSANSPPASD IAQSLQSTFA QAGITLEIIQ SDRKQLLTKY RARAHDIVII TWEPDYLDPH SSTDYFTRNT DNSDSAKSKT LAWRASWDIP ELTRETDKAV LEIDTEKRKA AYLALQREIQ TDSPIVVLFQ KVDQTAGRQE VTGFDSGPTG DTLVYARIRK D
|
| |