Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0225 |
Symbol | |
ID | 3909467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 253123 |
End bp | 254784 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882107 |
Product | extracellular solute-binding protein |
Protein accession | YP_483847 |
Protein GI | 86747351 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCATC TTCATTCCTT CAGCCGCACT GCGGTGCTGC TCGCTCTGAC GGCCGTCGGC TCTGTCGCCG TGCCGCAGAT CGCTTCATCC GAAACCGTGC TGCGGATCGG CATGACGGCC GCGGACATTC CGCGCACGCT CGGCCAGCCG GATCAGGGTT TCGAAGGCAA CCGCTTCACC GGCCTGACGA TGTATGACGG GCTGACGATG TGGGACCTGT CGTCCGCCAC CAAGGCGAGC GTGGTGATCC CGGGGCTCGC GACCGAATGG AAGGTCGAGG ACGCCGACAA GACCAAATGG ATCTTCAAGC TGCGCCCCGG CGTCACCTTC CACGACGGCA CGCCTTTCAA TGCCGACGCC GTGGTCTGGA ATGTCGACAA GGTGCTGAAC AAGGAGGCTG TTCAGTTCGA CGCCAGCCAG GTCGGCGTCA CCGCGTCGCG GATGCCGACG CTGGTCTCGG CCAAGAAGAT CGACGACATG ACGGTGGAGC TCACCGCCAA GGAGCCCGAC AGCTTCCTGC CGATCAACCT CACCAATCTG TTCATCGTCA GCCCGTCGAA ATGGCAGGCG CTGTACGAGA AGGCCGAGGG CGCCGACGCC AAGGCGCGGT CGCAGGCCGC CTGGGCCGCG TTCGCCAAGG ACGCCGCCGG CACCGGGCCG TGGAAGATGG CGAAGTTCAC GCCGCGCGAG CGGCTCGAAC TGGTGAAGAA CGACAATTAC TGGGACAAGG CGCGGGTGCC GAAGACCGAC CGCATGGTGC TGCTGCCGAT GCCCGAAGCC AACGCGCGCA CCGCGGCGCT GCTGTCCGGC CAGGTCGACT GGATCGAGGC GCCTGCGCCC GACGCGGTGA AGGAGATCAC CGCGCGCGGC TTCAAGATCG AGAAGAACGA ACAGCCGCAT GTCTGGCCGT GGCAGTTCTC GCGGATCGAG GGCTCGCCCT GGAACGACAT CCGCGTCCGC CGCGCCGCCA ATCTGTGCAT CGACCGCGAA GGCCTGCGTG ACGGCCTGCT CGCCGGCCTG ATGGTGCCGG CGACCGGCAC CTTCGAGCCC GGCCATCCGT GGCGCGGCAA TCCGTCGTTC CAGATCAAGT ACGATCTGCC GGCGGCGCAG AAGCTGATGA AGGAGGCCGG CTTCGGCCCC GACAAGAAGC TCAGCGTCAA GGTGCAGACC TCGGCGTCCG GCTCCGGCCA GATGCTGCCG CTGCCGATGA ACGAATATCT GCAGCAGGCT CTCGCCGAGT GTTACTTCGA CGTCAAGCTC GATGTCATCG AATGGAACAC GCTGTTCACC AATTGGCGGC GTGGCGTCAA GGATCCCTCG GCCAACGGCA GCAACGCCAC CAATGTCACC TACGCGGCGA TGGATCCGTT CTTCGCGCTG GTCCGCTTCC TGCAATCGTC GATGGCGCCG CCGGTGTCGA ACAATTGGGG CTACATCAAC AACCCCAAGT TCGACGAGTT GGTGAAGAAG GCACGGACCA GCTTCGACGC CGCCGAGCGC GACAAGGCGC TGGCCGAGCT GAACGCCGCC TCGATCGACG ACGCGGCCTT CCTCTACGTC GCCCACGACG TCGGCCCGCG CGCGATGAGT CCGAAGGTCA AGGGCTTCGT GCAGCCCAAG AGCTGGTTCG TCGACTTCTC GCCGGTGTCG ATGGCGCCGT AA
|
Protein sequence | MRHLHSFSRT AVLLALTAVG SVAVPQIASS ETVLRIGMTA ADIPRTLGQP DQGFEGNRFT GLTMYDGLTM WDLSSATKAS VVIPGLATEW KVEDADKTKW IFKLRPGVTF HDGTPFNADA VVWNVDKVLN KEAVQFDASQ VGVTASRMPT LVSAKKIDDM TVELTAKEPD SFLPINLTNL FIVSPSKWQA LYEKAEGADA KARSQAAWAA FAKDAAGTGP WKMAKFTPRE RLELVKNDNY WDKARVPKTD RMVLLPMPEA NARTAALLSG QVDWIEAPAP DAVKEITARG FKIEKNEQPH VWPWQFSRIE GSPWNDIRVR RAANLCIDRE GLRDGLLAGL MVPATGTFEP GHPWRGNPSF QIKYDLPAAQ KLMKEAGFGP DKKLSVKVQT SASGSGQMLP LPMNEYLQQA LAECYFDVKL DVIEWNTLFT NWRRGVKDPS ANGSNATNVT YAAMDPFFAL VRFLQSSMAP PVSNNWGYIN NPKFDELVKK ARTSFDAAER DKALAELNAA SIDDAAFLYV AHDVGPRAMS PKVKGFVQPK SWFVDFSPVS MAP
|
| |