Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3534 |
Symbol | |
ID | 4024048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3923295 |
End bp | 3925175 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963738 |
Product | extracellular solute-binding protein |
Protein accession | YP_570658 |
Protein GI | 91977999 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.173833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCAGC TCAACCGCCG CAATGTGCTC GGCCTCGGAA TCGGCGCGCT GGCCGCGGCG CATCTTCGCC CCGCGGCTGC GGCCGAGGGA GAGACGGTCG CCCACGGCAT GTCCGCCTTC GGCGACCTGA AGTACCGGGC CGATTTTCCG CATTTCGACT ACGTCAATCC TCGGGCGCCG AAGGGCGGGC TGTTCTCGAC CATTCCGTCG GTGCGCGCCT TCAACCAGTC GTTTCACACG TTCAATTCGC TCAATGCCTA CGTCCTGAAG GGCGATGGCG CTCAGGGCAT GGGCCTCACT TTCGCGACGC TGATGGCGCG GGCCGGCGAC GAGCCCGACG CGATGTACGG CCTCGCGGCG TCGTCGGTGG CGATCTCTCG CGACGGTCTG ACCTATCGCT TCACCATGCG CCCGGAGGCG CGCTTCCACG ACGGCAGCAA GCTCACCGCG CGAGACGCCG CGTTCTCGCT GAACATCCTG AAGGCCAAGG GCCATCCGCT GATCACCCAG CAGATGCGCG ACTTCATCAA GGCGGAAGCG ACCGACGACG CCACGCTGGT CGTGACGTTC GCGCCGAAGC GCGGCCGCGA CGTACCGCTG TTCACGGCCT CCCTGCCGCT GTTCTCCGAG GCCTACTACG CGAAACGGCC GTTCGACGAA TCGACCATGG AGGTGCCGCT CGGCAGCGGC CCCTACAAGG TGGGCCATTT CGAATCCGGC CGCTCCATCA CCTTCGATCG CGTCAAGGAC TGGTGGGGCG CGAAGCTGCC GGTCAATGTC GGGTCGAACA ATTTCGACAC CGTCCGGTTC GAGTTCTATC GCGATCGCGA CGTCGCTTTC GAGGGCTTCT CCGGCCGCAA TTATCTGTAT CGCGAGGAGT TCACCTCGCG GATCTGGAGT ACGCGCTACG ATTTCCCCGC GGTCCATGAC GGCCGGGTCA AGCGCGAGCA GCTTCCTGAC GGGACCCCGT CCGGCTCGCA GGGCTGGTTC ATCAACACGC GGCGCGACAA GTTCAAGGAC CCGCGGGTGC GCGAGGCGAT CGGCTGCGCG TTCGATTTCG AATGGACCAA CAAGACCATC ATGTACGGCG CCTATCAACG CACGGTGTCG CCGTTCCAGA ATTCCGATCT GATGGCGGTG GGGCCGCCGT CGCCCGACGA ACTGGCGCTG CTCGCACCGT ACCGCGGCAA GGTGCAGGAC GAGGTGTTCG GCGCGCCGTT TCTGCCGCCG GCGTCCGATG GCTCGGGACA GGACCGCGCG CTGCTGCGCA GAGGCGGTCA GCTTCTGACC GAGGCCGGCT TCGCGATCAA GGATCGCCAG CGGCTGACGC CGCAGGGCGA GCCGATGCGG ATCGAGTTTC TGCTCGACGA GCCGTCGTTC CAGCCGCACC ACATGCCGTT CATCAAGAAC CTCGGCACCC TGGGGATCGA GGCGACGTTG CGGCTGGTCG ACCCGGTGCA GTTTCGCGCC CGACGTGACG ATTTTGATTT CGATATGGCG ATCGAGCGCT TCGGCTTCTC GACCGTGCCG GGCGATGCGC TGCGCAGTTT CTTCTCGTCG CAATCGGCCG CGACCAAGGG CTCGAACAAT CTCGCCGGCA TCGCCGATCC CGCGATCGAT GCGATGATGG ATCAGGTGAT CGCTGCCGAC ACCCGGGCGA AGCTGGTTGT CGCCGCGCGG GCGCTCGACC GGCTGATCCG CGCCGGCCGC TATTGGGTGC CGCAATGGTA CTCCGCCTCG CACCGGCTGG CCTATTGGGA CGTGTTCGGC CATCCGCCGA ACCTGCCGAA ATATATCGGC GTCGGCGCGC CGGATCTGTG GTGGTCGGAG CCGAAAGCCG CGGCCGCCGC CGACGGCGAC GTCAAAGGCG AGGGAAAATA G
|
Protein sequence | MVQLNRRNVL GLGIGALAAA HLRPAAAAEG ETVAHGMSAF GDLKYRADFP HFDYVNPRAP KGGLFSTIPS VRAFNQSFHT FNSLNAYVLK GDGAQGMGLT FATLMARAGD EPDAMYGLAA SSVAISRDGL TYRFTMRPEA RFHDGSKLTA RDAAFSLNIL KAKGHPLITQ QMRDFIKAEA TDDATLVVTF APKRGRDVPL FTASLPLFSE AYYAKRPFDE STMEVPLGSG PYKVGHFESG RSITFDRVKD WWGAKLPVNV GSNNFDTVRF EFYRDRDVAF EGFSGRNYLY REEFTSRIWS TRYDFPAVHD GRVKREQLPD GTPSGSQGWF INTRRDKFKD PRVREAIGCA FDFEWTNKTI MYGAYQRTVS PFQNSDLMAV GPPSPDELAL LAPYRGKVQD EVFGAPFLPP ASDGSGQDRA LLRRGGQLLT EAGFAIKDRQ RLTPQGEPMR IEFLLDEPSF QPHHMPFIKN LGTLGIEATL RLVDPVQFRA RRDDFDFDMA IERFGFSTVP GDALRSFFSS QSAATKGSNN LAGIADPAID AMMDQVIAAD TRAKLVVAAR ALDRLIRAGR YWVPQWYSAS HRLAYWDVFG HPPNLPKYIG VGAPDLWWSE PKAAAAADGD VKGEGK
|
| |