Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0979 |
Symbol | |
ID | 3833460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 1164259 |
End bp | 1165851 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637825068 |
Product | extracellular solute-binding protein |
Protein accession | YP_426067 |
Protein GI | 83592315 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.297745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCTTA AGCGTATGAG CCGCTGGGCG GCGGCGGTTC TGGTGGCCTC GGCGGCCGCC TTCACCGCCG CCCCCGCCAC CCGGGCCGCC GAAGGACCCA AGGTCTTGCG GATCGTTCCC CAGGCCGATC TGAAGATCCT CGATCCGATC TGGACGACCG CCTTCGTCAC CCGCAACCAC GGCTATATGA TTTACGACAC GCTGTTTGGC GTCGATGCCG AAGGCGTCAT CCATCCCCAA ATGGTCGATC GCTATGAGGC TTCGGCCGAC GCTAAAAGCT TCCGCTTCAC CCTGCGCGAG GGGTTGGCCT TCCACGATGG CCAGCCGGTG ACCGCCACCG ATGTTATCGC CTCGCTCAAA CGCTGGGGGG CGCGCGACAA TCTTGGCCAG AAGATGCTGG CCAGCCTTGA GTCCATCGAG GCGGTCGACG CCAAAACCGT CGCCATGACC TTCAAGACGC CCTTTGGCAT GGTGCTGGAC GCGCTGAGCA AGCCGTCCTC GGTTCCGGCC TTCATCATGC CCGCCCGCGT CGCCGCCACG CCGCCCGACC AGCAGATCAC CGATACCACC GGCTCGGGCC CCTATATGTT CGTCAAGGAT GAATTCCGTC CGGGCGAACG GGTGGTCTAT GCCAAGAACC CGGCCTATGT GCCGCGCGCC GAACCGCCCT CGGGCACGGC CGGCGGCAAG GTGGTTTACG TCGACCGCGC CGAATGGATC ATCCTCAAGG ACGCCCAGAC CCAGGCCAAC GCCCTGGTCA ATGGTGAGGT CGATCTGATC GAATGGCTTC CCGCCGAGCA GTATGCCGGG CTGAAAAGCA AGCCCGACAT CAAGATGGAA GCCCAGGTGG TGAAGATGTC GGTGATGCTT CATCTCAATC ACCTGATCGC GCCCTTCAAC AATCCCAAGA TCGCCCAGGC GGCGTTGATG GCGATCAATC AGCAGGCCTT GATGCGCGCC CAGCTGGTTC ACAAGGAGCT CTATAACGGC TGCACCTCGA TCTATCCCTG CGGCACGACC TATGCCTCGG AGAACACCGC GTTCTTCACC GGCAAGCCGC AGTTCGACAA GGCCAAGGCC CTGCTCAAGG AAGCCGGCTA CGACGGCACC CCGGTGGTCC TGATGTATCC GGCCGATTTC GCCGTCATCA ACAAATACCC GCCGGTGATG GCCGAATTGC TCAAGCAGGC CGGGTTCGTC GTCGATATGC AGTCGATGGA TTGGCCGACC CTGGTGACGC GACGCACCAA GAAGGACCCG GTGGCGGCGG GCGGCTGGAA CGCCTTCATC ACCTCGTGGG GCATGGCCGA TACCATGAAC CCGATGTTCT TTGCCCCGCT GACCGGCAGT GGCGAGAAGG GTTGGTTCGG CTGGACCACC GATGACCAGC TTGAAGCCCT GAAAAGCGAG TTCCTGGTTA CCACCGATGG GGCGACGCGC AAGAAACTGG CCGAGGGCAT TCAGCTCCGC GCCATCGAAG CCGCCGTCTT CGGGCCGATC GGCGAGTTCA AGCCGCTGAC CGCCTACCGG ACATCGGTCA GCGGGCTGGT AACCGCCCCC GTTCCGGTGT TCTGGAACCT CAAGAAGGAC TGA
|
Protein sequence | MSLKRMSRWA AAVLVASAAA FTAAPATRAA EGPKVLRIVP QADLKILDPI WTTAFVTRNH GYMIYDTLFG VDAEGVIHPQ MVDRYEASAD AKSFRFTLRE GLAFHDGQPV TATDVIASLK RWGARDNLGQ KMLASLESIE AVDAKTVAMT FKTPFGMVLD ALSKPSSVPA FIMPARVAAT PPDQQITDTT GSGPYMFVKD EFRPGERVVY AKNPAYVPRA EPPSGTAGGK VVYVDRAEWI ILKDAQTQAN ALVNGEVDLI EWLPAEQYAG LKSKPDIKME AQVVKMSVML HLNHLIAPFN NPKIAQAALM AINQQALMRA QLVHKELYNG CTSIYPCGTT YASENTAFFT GKPQFDKAKA LLKEAGYDGT PVVLMYPADF AVINKYPPVM AELLKQAGFV VDMQSMDWPT LVTRRTKKDP VAAGGWNAFI TSWGMADTMN PMFFAPLTGS GEKGWFGWTT DDQLEALKSE FLVTTDGATR KKLAEGIQLR AIEAAVFGPI GEFKPLTAYR TSVSGLVTAP VPVFWNLKKD
|
| |