Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00311 |
Symbol | mhpT |
ID | 8114221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 344694 |
End bp | 345905 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644846597 |
Product | hypothetical protein |
Protein accession | YP_002998170 |
Protein GI | 251783866 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACTC GTACCCCTTC ATCATCTTCA TCCCGCCTGA TGCTGACCAT CGGGCTTTGT TTTTTGGTCG CTCTGATGGA AGGGCTGGAT CTTCAGGCGG CTGGCATTGC GGCGGGTGGC ATCGCCCAGG CTTTCGCACT CGATAAAATG CAAATGGGCT GGATATTTAG CGCCGGAATA CTCGGTTTGC TACCCGGCGC GTTGGTTGGC GGAATGCTGG CGGACCGTTA TGGTCGTAAG CGTATTTTGA TTGGCTCAGT TGCGCTGTTT GGTTTGTTCT CACTGGCAAC GGCGATTGCC TGGGATTTCC CCTCACTGGT CTTTGCGCGG CTGATGACCG GTGTCGGGCT GGGGGCGGCG TTGCCGAATC TTATCGCCCT GACGTCTGAA GCCGCGGGTC CACGTTTTCG TGGGACGGCA GTGAGCCTGA TGTATTGCGG TGTTCCCATT GGCGCGGCGC TGGCGGCGAC ACTGGGTTTC GCGGGGGCAA ACTTAGCATG GCAAACGGTG TTTTGGGTAG GTGGTGTGGT GCCGTTGATT CTGGTGCCGC TATTAATGCG CTGGCTGCCG GAGTCGGCGG TTTTCGCTGG CGAAAAACAG TCTGCGCCAC CACTGCGTGC CTTATTTGCG CCAGAAACGG CAACCGCGAC GCTGCTGCTG TGGTTGTGTT ATTTCTTCAC TCTGCTGGTG GTCTACATGT TGATCAACTG GCTACCGCTA CTTTTGGTGG AGCAAGGATT CCAGCCATCG CAGGCGGCAG GGGTGATGTT TGCCCTGCAA ATGGGGGCGG CAAGCGGGAC GTTAATGTTG GGCGCATTGA TGGATAAGCT GCGTCCAGTA ACCATGTCGC TACTGATTTA TAGCGGCATG TTAGCTTCGC TGCTGGCGCT TGGAACGGTG TCGTCATTTA ACGGTATGTT GCTGGCGGGA TTTGTCGCGG GGTTGTTTGC GACAGGTGGG CAAAGCGTTT TGTATGCCCT GGCACCGTTG TTTTACAGTT CGCAGATCCG CGCAACAGGT GTGGGAACAG CCGTGGCGGT AGGGCGTCTG GGGGCTATGA GCGGTCCGTT ACTGGCCGGG AAAATGCTGG CATTAGGCAC TGGCACGGTC GGCGTAATGG CCGCTTCTGC ACCGGGTATT CTTGTTGCTG GGTTGGCGGT GTTTATTTTG ATGAGCCGGA GATCACGAAT ACAGCCGTGC GCCGATGCCT GA
|
Protein sequence | MSTRTPSSSS SRLMLTIGLC FLVALMEGLD LQAAGIAAGG IAQAFALDKM QMGWIFSAGI LGLLPGALVG GMLADRYGRK RILIGSVALF GLFSLATAIA WDFPSLVFAR LMTGVGLGAA LPNLIALTSE AAGPRFRGTA VSLMYCGVPI GAALAATLGF AGANLAWQTV FWVGGVVPLI LVPLLMRWLP ESAVFAGEKQ SAPPLRALFA PETATATLLL WLCYFFTLLV VYMLINWLPL LLVEQGFQPS QAAGVMFALQ MGAASGTLML GALMDKLRPV TMSLLIYSGM LASLLALGTV SSFNGMLLAG FVAGLFATGG QSVLYALAPL FYSSQIRATG VGTAVAVGRL GAMSGPLLAG KMLALGTGTV GVMAASAPGI LVAGLAVFIL MSRRSRIQPC ADA
|
| |