Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4022 |
Symbol | |
ID | 4024539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4469508 |
End bp | 4471118 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637964225 |
Product | extracellular solute-binding protein |
Protein accession | YP_571142 |
Protein GI | 91978483 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00714098 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCACA CGTCGCATTG GCTGCGTTCA GTAGCTGTGT CGAAATTCGC AGTGCCGGCG CTGGCGCTCG CAGCTTCGCT GACGCTCCCT GCGTTTGCTG ATGCCAAGAC CATTCATGCG GTGATGCATT CCGATCTGCG CGTGACCGAT CCCGGACTGA CCACCGCCTA CATCACCCGC GATCATGGCT ACATGGTCTA TGATACGCTG CTCGCGATGG ACTCGAACTT CAAGGTCCAG CCGCAGATGG CGGAGTGGAA GGTCTCCGAG GACAAACTGA CCTACACCTT CACGCTGCGC GACGGCCTGA AGTGGCACGA TGGCGCTCCG GTCACCGCCG AGGATTGCGT CGCCTCGCTG AAGCGCTGGG GCCAGAAAGA CGGCATGGGC CAGAAGCTGA TGGACTTCAC GGCGAGCCTC GAGGCCACCG ATGCGAAGAC TATCACGCTG AAGTTGAAGG AGCCCTACGG GCTGGTGCTG GAGTCGATCG GCAAGCCATC GTCGCTGGTG CCGTTCATGA TGCCGAAGCG GATCGCCGAG ACGCCGCCGG ACAAGGCGAT CCCCGAGCAG ATCGGCTCCG GTCCGTTCAA ATTCGTCGCC GCGGAATTTC AGCCGGGCGT CAAGGCGGTT TACGTCAAGA ACGCCGACTA CGTGCCGCGC AAGGAGGCGC CGAGCTGGAC GTCGGGCGGC AAGGTGGTGA AGGTCGACCG GGTCGAATGG ATCACCATGC CCGACGCGCA GACCGCGGTG AACGCGCTGC AATCCGGCGA CATCGATTTC ATCGAGAATC CGTCCTTCGA CATCCTGCCC GTTCTGAAGC AGGACAAGGA ATTGACGATC CACACGCTGA GCCCGCTCGG CTTTCAGACG CTCGGCCGGA TGAACTTCCT GTATCCGCCG TTCGATAACG TCAAGGTTCG CCGCGCCGCG TTTCTGGCGA TGAGCCAGAA GCCGGTGCTC GATGCGCTGG TCGGCAACCC GCAATACTAC AAGGTCTGCG GCGCCGTGTT CGGCTGCGGC ACGCCGCTGG CGTCGGACGT CGGCTCCGAG ACGTTGGTCA AGGGCAGCGG CATGGCGGAG GCCAAGAAGC TGCTCGCCGA GTCCGGCTAT GACGGCACGC CGATCGCGCT GATGGCGCCG GGCGACGTCG TCACCCTGAA GGCGCAGCCG ATCGTGGCGG CGCAGCTGTT GCGCGAGGCC GGTTTCAAGG TCGACGTCCA GGCCACCGAC TGGCAGACCG TGGTGACGCG GCGCGCCAGC CAGAAACCGC CGAAGGACGG CGGCTGGAAC ATGTTCTTCA CCAATTGGGC GGGTCCGGAC ATTCTCAATC CGGTCGCCAA TGTTTCGACC GGGGGCAAGG GCAAGAACGG CGGCTGGTTC GGCTGGGCGG AGGACGCCAG GGTTGAAGAG CTGCGCGACA AATTCGCCCG CGCGACCTCG CCCGACGAGC AGAAGAAGCT CGCCGAGGAG ATCCAGAAAG AAGTCTACGA CAAGGTGATC TATATTCCGC TCGGCCAGTA CACAGCGCCC AGCGTATGGC GCAACGAACT GACCGGCGTG CTCGACGGCC CGGCGACGCC GGTGTTCTGG AATATCGACA AGAAGGAATA G
|
Protein sequence | MFHTSHWLRS VAVSKFAVPA LALAASLTLP AFADAKTIHA VMHSDLRVTD PGLTTAYITR DHGYMVYDTL LAMDSNFKVQ PQMAEWKVSE DKLTYTFTLR DGLKWHDGAP VTAEDCVASL KRWGQKDGMG QKLMDFTASL EATDAKTITL KLKEPYGLVL ESIGKPSSLV PFMMPKRIAE TPPDKAIPEQ IGSGPFKFVA AEFQPGVKAV YVKNADYVPR KEAPSWTSGG KVVKVDRVEW ITMPDAQTAV NALQSGDIDF IENPSFDILP VLKQDKELTI HTLSPLGFQT LGRMNFLYPP FDNVKVRRAA FLAMSQKPVL DALVGNPQYY KVCGAVFGCG TPLASDVGSE TLVKGSGMAE AKKLLAESGY DGTPIALMAP GDVVTLKAQP IVAAQLLREA GFKVDVQATD WQTVVTRRAS QKPPKDGGWN MFFTNWAGPD ILNPVANVST GGKGKNGGWF GWAEDARVEE LRDKFARATS PDEQKKLAEE IQKEVYDKVI YIPLGQYTAP SVWRNELTGV LDGPATPVFW NIDKKE
|
| |