Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03943 |
Symbol | proP |
ID | 8115759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4235671 |
End bp | 4237173 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644850096 |
Product | hypothetical protein |
Protein accession | YP_003001669 |
Protein GI | 251787365 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGAAAAAAGT AAAACCCATT ACCCTTCGTG ATGTCACCAT TATTGATGAC GGTAAACTGC GTAAAGCCAT TACCGCAGCA TCACTGGGTA ATGCAATGGA ATGGTTTGAT TTTGGTGTTT ATGGTTTTGT TGCTTACGCA TTAGGTAAAG TTTTTTTCCC GGGGGCTGAC CCCAGCGTGC AGATGGTTGC TGCACTTGCC ACTTTCTCCG TTCCCTTTCT GATTCGACCG CTTGGCGGGC TCTTCTTTGG TATGTTGGGC GATAAATATG GTCGCCAGAA GATCCTCGCT ATCACTATTG TGATTATGTC GATCAGTACG TTCTGTATTG GCTTAATACC GTCCTACGAC ACGATTGGTA TTTGGGCACC GATTCTGCTG TTGATCTGTA AGATGGCACA AGGTTTCTCG GTCGGCGGTG AATATACCGG GGCGTCGATA TTTGTTGCGG AATACTCCCC TGACCGTAAA CGTGGCTTTA TGGGCAGCTG GCTGGACTTC GGTTCTATTG CCGGGTTTGT GCTGGGTGCG GGCGTGGTGG TGTTAATTTC GACCATTGTC GGCGAAGCGA ACTTCCTCGA CTGGGGCTGG CGTATTCCGT TCTTTATTGC TCTGCCGTTA GGGATTATCG GGCTTTACCT GCGCCATGCG CTGGAAGAAA CTCCGGCGTT TCAGCAGCAT GTTGATAAAC TGGAACAGGG CGACCGCGAA GGTTTGCAGG ATGGCCCGAA AGTCTCGTTT AAAGAGATTG CCACTAAATA CTGGCGCAGC CTGTTGACAT GTATTGGTCT GGTTATTGCC ACCAACGTGA CTTACTACAT GTTGCTGACC TATATGCCGA GTTATTTGTC GCATAACCTG CATTACTCCG AAGACCACGG GGTGCTGATT ATTATCGCCA TTATGATCGG TATGCTGTTT GTCCAGCCGG TGATGGGCTT GCTGAGTGAC CGTTTTGGCC GTCGTCCGTT TGTGCTACTT GGTAGTGTTG CCCTGTTTGT GTTGGCGATC CCGGCGTTTA TTCTGATTAA CAGTAACGTC ATCGGTCTGA TTTTTGCCGG GTTACTGATG CTGGCGGTGA TCCTTAACTG CTTTACGGGC GTTATGGCTT CTACCTTGCC AGCGATGTTC CCGACGCATA TCCGATACAG CGCGCTGGCG GCGGCATTTA ATATTTCGGT GCTGGTTGCC GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTCGAAAGCT CGCAGAATCT GATGATGCCT GCCTATTACC TGATGGTAGT GGCGGTGGTT GGTTTAATCA CCGGCGTAAC CATGAAAGAG ACGGCAAATC GTCCGTTGAA AGGTGCGACA CCGGCGGCGT CAGATATACA GGAAGCGAAG GAAATTCTCG TCGAGCATTA CGATAATATC GAGCAGAAAA TCGATGATAT TGACCACGAG ATTGCCGATT TGCAGGCGAA ACGTACCCGC CTGGTGCAGC AACATCCGCG AATTGATGAA TAA
|
Protein sequence | MLKRKKVKPI TLRDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD PSVQMVAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYD TIGIWAPILL LICKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA GVVVLISTIV GEANFLDWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE GLQDGPKVSF KEIATKYWRS LLTCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI IIAIMIGMLF VQPVMGLLSD RFGRRPFVLL GSVALFVLAI PAFILINSNV IGLIFAGLLM LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLVA GLTPTLAAWL VESSQNLMMP AYYLMVVAVV GLITGVTMKE TANRPLKGAT PAASDIQEAK EILVEHYDNI EQKIDDIDHE IADLQAKRTR LVQQHPRIDE
|
| |