Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2625 |
Symbol | |
ID | 4023122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2943144 |
End bp | 2944976 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637962823 |
Product | extracellular solute-binding protein |
Protein accession | YP_569755 |
Protein GI | 91977096 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.15211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000486729 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCCTGC GCGCGGGCGC GCTGGCGGCG GTGATGGTCG TTGGCGTCGT CGCTTTGTTC AGCGGCGTCG CGCAGGCCGG AGCAGAGGAC GCGGCGAAAC CGTCGCATGC GCTGGCGATG CACGGCGAGC CTGCTCTGCC TGCCGACTTC ACCGCGATGC CCTATGTCAA CCCCGATGCG CCGAAAGGCG GTCGTCTGGT GGAAGGTCTG CTCGGCACCT TCGACAGCCT CAATCCGTTC ATCGTCAGGG GCATCGCGGT GCAGAGGATG CGCGGCTACG TCGTCGAGAG CCTGCTGGCG CGCGGCAACG ACGAAGCGTT CACGCTTTAC GGCCTGCTGG CGCAATCGGT CGAGACCGAC GATGCGCGCA GCTACGTCAC CTTTCGCATC GATCCGCGCG CGCGGTTCTC GGACGGCAAG CCGGTGCTGG CGCAAGACGT GCTGTTCTCC TGGCAATTGC TGCGCGACAA GGGCCGCCCC AATCATCGCA TTTACTACGC CAAGGTCGCG CGCGCCGAGG CGCCCGATCC GCGCACGGTG CGGTTCGATT TCGGCGACGT GAACGATCGG GAACTGCCAC TGATCCTCGG CCTGATGCCG ATCTTTCCCA AACACGCGGT CAACCCCGAC ACCTTCGAGG AGACGACGCT GGCGCCGCCG ATCGGGTCAG GTCCGTACCG TGTCGGCGCG GTGAAGGCCG GCGCCAGCGT CACGCTGATC CGGAACCCCG ACTATTGGGG GCGCGATCTT CCGATCAATC GCGGACTATG GAACTTCGAC GAGATCCGGA TCGATTATTT CCGTGAGGCT AACTCACATT TCGAAGCCTT CAAACGCGAG CTGTATGACT ACCGCGTCGA AAACGAGCCG CTGCGCTGGC ATGACGGCTA TGACTTTCCG GCGGCCCGCA ACGGCGACGT GATCCGCGAC GCCTTCAAAA TCCGCATGCC GCAGCCGACC GAATTTCTGG TGTTCAACAC CCGCCGTCCG GTATTCGCCG ATATCCGGGT CCGCGAAGCG CTGTTGCAAT TGTTCGATTT CGCATGGATC AACCGCAACT ACTTTTTCGG CCTGTATGCG CGCGCAGGTG GCTTCTTCGC GGGTTCGGAG CTCTCCGCCT ACACCCGCCC GGCGGAAGCC GGCGAACTTC AACTGCTGAA GCCCTATCTG GCGCGACTGC GCGCCGACGT CATCGACGGC AGCTACCGCC TGCCCGTCAG CGACGCCTCC GGCCGCGATC GCGCCACGCT CGGCCGGGCG CTGTCGCTGC TGGCGGAGGC CGGCTATCAG CTCGACGGCA CGGTGCTGCG GCGGCGCGAC AATCACCAGC CGCTGACCTT CGAAATCCTG GTCACCACGC GCGATCAGGA GCGCATCGCG CTGGCCTTCG CCCGCGACGT CAAGCGTGTC GGCATCCAAA CCTCGGTCCG CGTGGTGGAC GCGGTGCAGT TCGATCAGCG GCGGATCTCT TACGACTTCG ACATGATCCC CAACCGCTGG GACCATTCGC TGTCGCCGGG CAATGAGCAA TCGTTCTATT GGGGTGCGGA AGCGGCCGAC ACCCAGGGCA CCCGCAACTA CATGGGCGCG AAGGATCCAG CGATCGACGC CATGATCGCG GCCATGATCG CGGCGCGCGG GCATCCGGAA TTCGTCGATG CGGTGCGGGC GCTCGATCGC GTCCTGACCT CGGGCTTCTA CGTGATCCCG CTCTACAACA TCCAGGAACA ATGGATCGCG CGTTGGAATC GGATAGAACG GCCGAAAGCG AACGCACTGA CCGGCTACCT GCCCGAGACC TGGTGGGCCC GGCCACCGAC GCAGCAAAGG TGA
|
Protein sequence | MALRAGALAA VMVVGVVALF SGVAQAGAED AAKPSHALAM HGEPALPADF TAMPYVNPDA PKGGRLVEGL LGTFDSLNPF IVRGIAVQRM RGYVVESLLA RGNDEAFTLY GLLAQSVETD DARSYVTFRI DPRARFSDGK PVLAQDVLFS WQLLRDKGRP NHRIYYAKVA RAEAPDPRTV RFDFGDVNDR ELPLILGLMP IFPKHAVNPD TFEETTLAPP IGSGPYRVGA VKAGASVTLI RNPDYWGRDL PINRGLWNFD EIRIDYFREA NSHFEAFKRE LYDYRVENEP LRWHDGYDFP AARNGDVIRD AFKIRMPQPT EFLVFNTRRP VFADIRVREA LLQLFDFAWI NRNYFFGLYA RAGGFFAGSE LSAYTRPAEA GELQLLKPYL ARLRADVIDG SYRLPVSDAS GRDRATLGRA LSLLAEAGYQ LDGTVLRRRD NHQPLTFEIL VTTRDQERIA LAFARDVKRV GIQTSVRVVD AVQFDQRRIS YDFDMIPNRW DHSLSPGNEQ SFYWGAEAAD TQGTRNYMGA KDPAIDAMIA AMIAARGHPE FVDAVRALDR VLTSGFYVIP LYNIQEQWIA RWNRIERPKA NALTGYLPET WWARPPTQQR
|
| |