Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1771 |
Symbol | |
ID | 3909758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2030473 |
End bp | 2032350 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883665 |
Product | extracellular solute-binding protein |
Protein accession | YP_485390 |
Protein GI | 86748894 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.235807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0344403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGC TCAACCGCCG CGAAGTTCTC GTCCTCGGCG TCGGCGCCCT GGCGGCGGCC CGGATCAGCC CCGCATCCGC GGCCGACGGC GAGACCACAG CCCACGGCAT GTCGGCGTTC GGCGACCTGA AATACAAGGC GGACTTTCCG CATTTCGACT ACGTCGATCC GCAGGCCCCC AAAGGCGGGC TGTTCTCGAC CATCCCGTCC AGCCGCGCCT TCAATCAATC GTTCCAGACC TTCAACTCGC TCAACGCCTA CATCCTCAAG GGCGACGGCG CGCAGGGCAT GGGTCTCACC TTCGCGTCCT TGATGGCCCG GGCCGGCGAC GAGCCCGACG CGATGTACGG CCTCGCGGCC GCGAGGGTCG CGATCTCCGC CGACGGGTTG AGCTATCGCT TCACCATGCG TCCAGAGGCG CGCTTCCACG ACGGCAGCAA GCTCACCGCG CGCGACGCGG CGTTCTCGCT GAACATCCTC AAGGCCAAGG GGCATCCGCT TATCACCCAG CAGATGCGCG ATTTCATCGA GGCGAAGGCA GTCGACGATT CGACGCTGTT GGTGACGTTC AAGCCGAAGC GCGGCCGCGA CGTGCCGCTG TTCGTGGCCG CGCTGCCGTT GTTCTCCGAG GCGTACTACG CCAAGCGGCC GTTCGACGAA TCGACCATGG AGATCCCGCT CGGCAGCGGG CCCTACAAGG TCGGCCCTTT CGAATCCGGC CGCTTCATCG CTTTCGAGCG CGTCAAGGAC TGGTGGGGCG CGGCGCTGCC GGTCAATGTC GGGGCGTTTA ATTTCGACAC CGTCCGGTTC GAGTTCTATC GCGATCGCGA CGTCGCCTTC GAGGGCTTCA CCGGCCGCAA CTATCTGTAT CGCGAGGAAT TCACCTCGCG GATCTGGAAC ACGCGCTACG ACTTTCCGGC GATCCATGAG GGCCGGGTCA AGCGCGAGAC GCTGCCCGAC GAGACGCCGT CCGGCGCGCA GGGCTGGTTC ATCAACACGC GGCGCGACAA GTTCAAGGAT CCGCGGGTGC GCGAGGCGCT GGGCTGCGCG TTCGACTTCG AATGGACCAA CAAGACCATC ATGTACGGCG CCTATGCGCG CACGGTGTCG CCGTTCCAGA ATTCCGACAT GATGGCGGTC GGGCCGCCGT CGGCCGACGA ACTGGCGCTG CTCGAGCCGT TCCGCGGCAA GGTGCCCGAC GAGGTGTTCG GCGCGCCGTT CGTGCCGCCG GCCTCCGACG GCTCGGGGCA GGACCGCGCG CTGCTGCGCC GCGGCGGTCA GTTGCTGACA GAGGCCGGAT TCGTCGTCAA GGACCGCAAG CGCCTGATGC CGAACGGCGA GCCGATGCGC GTCGAGTTTC TGCTCGACGA GCCGGCGTTC CAGGCCCACC ACATGCCGTT CGTCAAGAAC CTCGCCACGC TCGGCATCGA GGCGACGGTG CGGCTGGTCG ACCCGGTTCA GTCGCGGGCG CGCCGCGACG ATTTCGACTT CGACATGGCG ATCGAGCGTT TCAGCTTTTC GACCGTGCCG GGCGAGGCAC TGCGCAATTT CTTCTCGTCG CAATCGGCCG CCATCAAGGG CTCGAACAAT CTCGCCGGCA TCGCCGATCC GGCGATCGAC GCGATGATCG ATCGGGTGAT CGCCGCCGAC AGCCGTGCCG ACCTCGTCGT CGCCGCGCGC GCGCTCGACC GGCTCGTGCG CGCCGGCCGC TATTGGGTGC CGCAGTGGTT TTCGTCGTCG CATCGGCTGG CCTATTGGGA CGTGTTCGGC CATCCGCCGA ACCTGCCGAA ATACACCGGC GTCAGCGCCC CGGACCTGTG GTGGGCGAAA AGCAATCCCG CCGCCGAGCG AAGCGACCCG AAGGGCGAGG GGAAGTAG
|
Protein sequence | MAQLNRREVL VLGVGALAAA RISPASAADG ETTAHGMSAF GDLKYKADFP HFDYVDPQAP KGGLFSTIPS SRAFNQSFQT FNSLNAYILK GDGAQGMGLT FASLMARAGD EPDAMYGLAA ARVAISADGL SYRFTMRPEA RFHDGSKLTA RDAAFSLNIL KAKGHPLITQ QMRDFIEAKA VDDSTLLVTF KPKRGRDVPL FVAALPLFSE AYYAKRPFDE STMEIPLGSG PYKVGPFESG RFIAFERVKD WWGAALPVNV GAFNFDTVRF EFYRDRDVAF EGFTGRNYLY REEFTSRIWN TRYDFPAIHE GRVKRETLPD ETPSGAQGWF INTRRDKFKD PRVREALGCA FDFEWTNKTI MYGAYARTVS PFQNSDMMAV GPPSADELAL LEPFRGKVPD EVFGAPFVPP ASDGSGQDRA LLRRGGQLLT EAGFVVKDRK RLMPNGEPMR VEFLLDEPAF QAHHMPFVKN LATLGIEATV RLVDPVQSRA RRDDFDFDMA IERFSFSTVP GEALRNFFSS QSAAIKGSNN LAGIADPAID AMIDRVIAAD SRADLVVAAR ALDRLVRAGR YWVPQWFSSS HRLAYWDVFG HPPNLPKYTG VSAPDLWWAK SNPAAERSDP KGEGK
|
| |