Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_4109 |
Symbol | |
ID | 8135107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | + |
Start bp | 4643094 |
End bp | 4644635 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644867427 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003019660 |
Protein GI | 253690470 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.344383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCAT TTGTTCGCCG CTCTGCCGTT GCCCTCGGGC TCTCACTGTG TCTGGCGGCT GTTGCCCAGG CGCAAGACCT TCGCATTTCT ATCTATGCCG ACATCACCGG GCTCGACCCG CACGACACCT CGGACACGCT GAGCTACTCC ATTCAGAGCG GCATCTTCGA GCGTCTGTTC CAGTTCGATA ATAAAATGAA GCTGGTACCG CGTCTGGCGA CGGGCTACAC CAGTAACGAT AACGCCACCG AATTCGTCGT AACGCTGCGT GAAGGCATCA CCTTCCAGGA CGGCGCACCG TTCAACGCCG ACGCCGTTAA AGCCAACCTC GACCGTCTGG CCGATCAGAG CAAAGGTCTG AAGCGCAACA GCCTGTTTAA CATGGTGCAA ACCGTCACCG TGCTGTCGCC GACGCAGGTT AAAATCGAGC TGAACAAATC CTTCGGTGCC TTTGTGAACA CGCTGGCGCA CCCGTCTGCC GTCATGCACA GCCCGGAAGC CCTGAAGAAA TACCCGGACG AAGCACAGCT ACGCGTACAC CCGGTCGGTA CCGGTCCATT CAAGTTCACC GAATGGCAGC AGGGTAAAGA CGTGAAGCTG GTGAAATTCG ACAACTACTG GCAGAAAGGC TGGCCGAAAG TCGACAGCGT GACCTTCTAC CCGACGCCGG AAGACTCCAC TCGTGTGGCG TCGCTGAAAT CCGGTCAGGT TGATGCCGTG TATCCACTGC CTTCCGATCT GATCGCCACC GTACAAAGCG ACAGCAAGCT GGCGATTCAG CGCGACCCGA GCATCTATCA ATTCTGGCTG GCGATGAACA ACCTGCGTCC GCCGTTGAAC GATATCCGCG TGCGTCAGGC GCTTAATTAC GCCATCAACC GCGACATCTG GCTGAAAGTG GGCTTTGCCG GTATGGGCGT TCCTGCCTCC TCGGCAATGG CGCCGGATGT GCAGTTCTTC GCGCGTCAAA CCTCGCCGAA CTACACCTAT AACCCAGAGA AAGCTAAGGC GCTGCTGAAA GAAGCGGGCT ATGCCAACGG CCTGAACCTG AAACTGTGGA CGACGAACCG CACCGACTAC ATCCGCAGCG CGCAGTTCTT CAAACAACAG TTAGAGCAGG TTGGCGTCAA AGTTACCGTC ACACCGATGG ATTCCGGGAT GCGTAATGCC AAACTGTTTG GCGTGAAAGA TCCGAAAGAT GCCGAATTTG ACCTGTTCTA CAACGGCTGG TCTCCATCCA CCGGTGATGC CGACTGGGCG CTGCGTCCGC TGTTCGCCAC CGAGTCTTGG GTGCCGGTCG CGTACAACGT CTCCTACTAC AGCAACCCGG TAGCGGATAA AGCGATTACC GCCGGTCTGG CCACCGCCGA TGCTGACAAA CGTGCCGCCG CTTACGCTGA CGCACAGCGC CAGATTTGGC AGGACGCGCC TGTGGTCTTC CTGGGTACAC CGGACAACAT TGTCGGTAAA ACCAAGAACC TCGACGGCGT GTACATGCTG GCAGACGGCT CGCTGATCTT CGATCAGGCT GAATTTAAGT AA
|
Protein sequence | MKPFVRRSAV ALGLSLCLAA VAQAQDLRIS IYADITGLDP HDTSDTLSYS IQSGIFERLF QFDNKMKLVP RLATGYTSND NATEFVVTLR EGITFQDGAP FNADAVKANL DRLADQSKGL KRNSLFNMVQ TVTVLSPTQV KIELNKSFGA FVNTLAHPSA VMHSPEALKK YPDEAQLRVH PVGTGPFKFT EWQQGKDVKL VKFDNYWQKG WPKVDSVTFY PTPEDSTRVA SLKSGQVDAV YPLPSDLIAT VQSDSKLAIQ RDPSIYQFWL AMNNLRPPLN DIRVRQALNY AINRDIWLKV GFAGMGVPAS SAMAPDVQFF ARQTSPNYTY NPEKAKALLK EAGYANGLNL KLWTTNRTDY IRSAQFFKQQ LEQVGVKVTV TPMDSGMRNA KLFGVKDPKD AEFDLFYNGW SPSTGDADWA LRPLFATESW VPVAYNVSYY SNPVADKAIT AGLATADADK RAAAYADAQR QIWQDAPVVF LGTPDNIVGK TKNLDGVYML ADGSLIFDQA EFK
|
| |