Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2474 |
Symbol | |
ID | 3835908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2873323 |
End bp | 2874900 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637826582 |
Product | extracellular solute-binding protein |
Protein accession | YP_427561 |
Protein GI | 83593809 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCCT GGATAGTCCT GACGGCCCCC GCCGCTTCGA CCGCCGCCGA GTTCCGCTGG ACGGCGGCGC AGGGCCCGGC GACGCTGGAG CCCCAATCCC CGGAAGCCCT GGCCGACCCC GGGGTTCTGG GCGACGTTTA TGAAACGCTG ATCGGCCGCG ACTCCGATCT TGACCTTGAA CCGGCCCTGG CGACCGCTTG GCAGGCGGTG ACCGATTTAC GCTGGCGGGT CACCCTGCGT CGGGGCGTGC GCTTCCATGA CGGCGCCGCC TTCACCGCCG ATGACGTGGT CGCCTCGCTG GAGCGCGCCA GCGCCCGCGG CGGCGGTCTG GCCGAAGCGC TGGCGCCGGT GCGCCGCGTC CGCCGGGTCG ATGATACGAC GGTCGATCTT TACACCCAAA CGCCCACCCC CGATCTGCCC CAACGCCTCG CCCTGATCAG AATCCTCGAC GACGGCTGGA TCGCCCGCGC CGATCCGGCG ATGCCGGCCA ATGGCACCGG CCCCTTCCGA GTGGACCGCT TCACCCCCGG CGGCGCCGTG ACGCTGGTGG CCAATGGCGA CTGGTGGGGA CGGACGATCC CGGCGGACCC GCTGGGGACG ACGGCGCCGC CCGCTCCACG GCTTGAGCGC GCCACCCTGA TTCCCGCCGC CCGCCCGGCC GATCGCTTGC GGCTTTTGCT TGAAGGCAAG GTGGATCTGG CGCTGGATCT GCCGCCCGCC GTGGTGCCTC CCCTGGCCGC CACCCCCGGC CTGCGCGCCC TGGTCATCGG CGGAACGCGC ACGGTCATGC TGGGGATGGA CCAGCGCCCG CCGCCCTTGG GCGCCGCTCG CGGCCAGGGC TCGCCCTTTC GCGACCGGCT GTTGCGCGAG GCGGTGCTGC GCGCCGTTGA TATGACCGAC ATCGACCAGC GCCTTTTCGC CAATCAGGCG ACCCCGGCCG CCCTGATCGC CGGGCCGATG ATCGCCGGGG TGCCCGCCGC CGCCGATATC CGCCCGGCCG CCGACCCAGA CCGGGCCCGC GCCCTGCTTG CCGAAGCCGG AGTCGGCCCC GCCGGAGTCA GCGTCACCCT GGATTGCCCG CTGGGCTTCT TCCTGAACGA CGGCGCCCTT TGCGACGCCA TCGCCACCTC TCTTTCGGAA GTCGGCATTC ATACCGCCGT CGCCACCCGT TTGGCCGAAA ATCATTTCCC GAGGGTTTTA CGCGGGGAAA GCCGGTTCTT CCTAACCGGA TACCAACCGT TGACCTTGGA TATCCTCGAT CCCTTGCGCG CCCTGGCCGC TTGTCCGCCG ACCGATGGCT CGCCGAAGGA CGGCTTTGGC CAAGCCAATG GCGCGGGCTA TTGCGACCCC TCCGTCGATC GGCTTATCCG CCGTCTGGCC GACGAAATGA TCCCGGCCCG CCAAAGCGCC CTTGCCGCCG AAATCGTCCT CAAACTGCGC GATGACGTCG TCTATGTCCC CCTCCACCAG GAACCGGTGA TCTGGGGGGC GCGGGCCGAT ATCGGTCTGC GCCAGCGCGC CGACGGCGTG CTTGATCTCC GCTGGGTCAG CCCGGCGCCC GTCCCGCAAG GGCGCTGA
|
Protein sequence | MTAWIVLTAP AASTAAEFRW TAAQGPATLE PQSPEALADP GVLGDVYETL IGRDSDLDLE PALATAWQAV TDLRWRVTLR RGVRFHDGAA FTADDVVASL ERASARGGGL AEALAPVRRV RRVDDTTVDL YTQTPTPDLP QRLALIRILD DGWIARADPA MPANGTGPFR VDRFTPGGAV TLVANGDWWG RTIPADPLGT TAPPAPRLER ATLIPAARPA DRLRLLLEGK VDLALDLPPA VVPPLAATPG LRALVIGGTR TVMLGMDQRP PPLGAARGQG SPFRDRLLRE AVLRAVDMTD IDQRLFANQA TPAALIAGPM IAGVPAAADI RPAADPDRAR ALLAEAGVGP AGVSVTLDCP LGFFLNDGAL CDAIATSLSE VGIHTAVATR LAENHFPRVL RGESRFFLTG YQPLTLDILD PLRALAACPP TDGSPKDGFG QANGAGYCDP SVDRLIRRLA DEMIPARQSA LAAEIVLKLR DDVVYVPLHQ EPVIWGARAD IGLRQRADGV LDLRWVSPAP VPQGR
|
| |