Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1226 |
Symbol | |
ID | 3910161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1403317 |
End bp | 1404927 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637883120 |
Product | extracellular solute-binding protein |
Protein accession | YP_484847 |
Protein GI | 86748351 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.560776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAAA CGTCGCATTG GCTGCGTTCA GTAGCTGCGT CGAAATTCGC GCTGCCGGCG CTGGCGCTCG CAGCTTCGCT GACACTGCCC GCGTTTGCCG ATGCCAAGAC CATTCAAGCG GTGATGCATT CCGATCTGCG CGTCACCGAT CCCGGTCTGA CGACCGCCTA CATCACGCGT GACCATGGCT ACATGGTTTA CGACACGTTG CTGGCGATGG ACTCCAACTT CAAAGTCCAG CCGCAGATGG CGGAGTGGAA GGTCTCCGAC GACAAGCTGA CCTACACGTT CACGTTGCGC GACGGCCTGA AATGGCACGA CGGCAAGCCG GTCACCGCCG AGGATTGCGT CGCGTCGCTG AAGCGCTGGG GCCAGAAGGA CGGCATGGGC CAGAAGCTGA TGGACTTCAC GGCCAGCCTT GAGGCTACCG ATGCCAAGAC CATCACGCTG AAGCTGAAGG AGCCCTACGG CCTGGTGCTC GAATCGATCG GCAAGCCGTC TTCGCTGGTG CCGTTCATGA TGCCGAAGCG TCTGGCCGAA ACCTCGCCCG ACAAGGCGAT CCCCGAGCAG ATCGGGTCCG GTCCGTTCAA GTTCGTCGCG GCCGAATTCC AGCCGGGCGT CAAGGCGGTG TACGTCAAGA ACACCGACTA CGTGCCGCGC AAGGAGGCGC CGAGCTGGAC CTCGGGCGGC AAGGTCGTGA AGGTCGACCG TGTCGAATGG ATCACCATGC CGGATGCGCA GACCGCGGTG AACGCGCTGC AGTCCGGCGA CATCGATTTC ATCGAGGCCC CGTCCTTCGA CATGCTGCCG GTGCTGAAGG CCGACAAGGA GCTGACGATT CAGACGCTCA GTCCGCTCGG CTTCCAGACG CTCGGCCGGA TGAACTTCCT GTATCCGCCG TTCGACAACC TCAAGGTCCG CCGCGCCGCG TTCCTGGCGT TGAGCCAGAA GCCGGTGCTC GACGCGCTGG TCGGCAATCC CGATTACTAC ATCGTCTGCG GTGCCGTGTT CGGTTGCGGC ACACCCTTGG CCTCCGACGT CGGCTCCGAG TCCCTGGTGA AGGGCAACGG CATGGCCGAG GCCAAGAAGC TGCTCGCCGA GTCCGGCTAT GACGGCACCC CGATCGCGCT GATGGCGCCG GGCGACGTGG TCTCGCTCAA GGCCCAGCCG ATCGTTGCGG CGCAACTCCT GCGCGAGGCC GGCTTCAAGG TCGACGTTCA GGCCACCGAC TGGCAGACAG TGGTGACGCG GCGCGCCAGC CAGAAGCCGC CCGCGGAAGG CGGCTGGAAC ATGTTCTTCA CCAATTGGGC CGGCCCGGAC ATTCTCAACC CGATCGCCAA CGTCTCGACC GGCGGCCGGG GCAAGAACGG CGGCTGGTTC GGCTGGCCGG AGGACGCCAA GATCGAAGAG CTGCGCGACA AGTTCGCCCG CTCGACCTCG CCGGAGGAGC AGAAGAAGCT CGCCGCGGAG ATCCAGAAGG AGGTCTACGA CAAGGTGATC TACATCCCGC TCGGGCAATA CAAGTCGCCG AGCGTGTGGC GGAAGGATCT GACCGGCGTG CTCACCGGCC CGGCGACCCC GGTGTTCTGG AACATCGACA AGAAGGAGTA G
|
Protein sequence | MIETSHWLRS VAASKFALPA LALAASLTLP AFADAKTIQA VMHSDLRVTD PGLTTAYITR DHGYMVYDTL LAMDSNFKVQ PQMAEWKVSD DKLTYTFTLR DGLKWHDGKP VTAEDCVASL KRWGQKDGMG QKLMDFTASL EATDAKTITL KLKEPYGLVL ESIGKPSSLV PFMMPKRLAE TSPDKAIPEQ IGSGPFKFVA AEFQPGVKAV YVKNTDYVPR KEAPSWTSGG KVVKVDRVEW ITMPDAQTAV NALQSGDIDF IEAPSFDMLP VLKADKELTI QTLSPLGFQT LGRMNFLYPP FDNLKVRRAA FLALSQKPVL DALVGNPDYY IVCGAVFGCG TPLASDVGSE SLVKGNGMAE AKKLLAESGY DGTPIALMAP GDVVSLKAQP IVAAQLLREA GFKVDVQATD WQTVVTRRAS QKPPAEGGWN MFFTNWAGPD ILNPIANVST GGRGKNGGWF GWPEDAKIEE LRDKFARSTS PEEQKKLAAE IQKEVYDKVI YIPLGQYKSP SVWRKDLTGV LTGPATPVFW NIDKKE
|
| |