Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02736 |
Symbol | galP |
ID | 8114206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2918347 |
End bp | 2919741 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644848927 |
Product | hypothetical protein |
Protein accession | YP_003000500 |
Protein GI | 251786196 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.39424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGACG CTAAAAAACA GGGGCGGTCA AACAAGGCAA TGACGTTTTT CGTCTGCTTC CTTGCCGCTC TGGCGGGATT ACTCTTTGGC CTGGATATCG GTGTAATTGC TGGCGCACTG CCGTTTATTG CAGATGAATT CCAGATTACT TCGCACACGC AAGAATGGGT CGTAAGCTCC ATGATGTTCG GTGCGGCAGT CGGTGCGGTG GGCAGCGGCT GGCTCTCCTT TAAACTCGGG CGCAAAAAGA GCCTGATGAT CGGCGCAATT TTGTTTGTTG CCGGTTCGCT GTTCTCTGCG GCTGCGCCAA ACGTTGAAGT ACTGATTCTT TCCCGCGTTC TGCTGGGGCT GGCGGTGGGT GTGGCCTCTT ATACCGCACC ACTGTACCTC TCTGAAATTG CGCCGGAAAA AATTCGCGGC AGTATGATCT CTATGTATCA GTTGATGATC ACTATCGGAA TCCTCGGTGC TTATCTTTCT GATACCGCCT TCAGCTACAC CGGTGCATGG CGCTGGATGC TGGGTGTGAT TATCATCCCG GCAATTTTGC TGCTGATTGG TGTCTTCTTC CTGCCAGACA GCCCACGTTG GTTTGCCGCC AAACGCCGTT TTGTTGATGC CGAACGCGTG CTGCTACGCC TGCGTGACAC CAGCGCGGAA GCGAAACGCG AACTGGATGA AATCCGTGAA AGTTTGCAGG TTAAACAGAG TGGCTGGGCG CTGTTTAAAG AGAATAGCAA CTTCCGCCGC GCGGTGTTCC TTGGCGTACT GTTACAGGTA ATGCAGCAAT TCACCGGGAT GAACGTCATC ATGTATTACG CGCCGAAAAT CTTCGAACTG GCGGGTTATA CCAACACCAC CGAGCAAATG TGGGGGACAG TGATTGTCGG CCTGACCAAC GTACTTGCCA CCTTTATCGC AATCGGCCTT GTTGACCGCT GGGGACGTAA ACCAACGCTA ACGCTGGGCT TCCTGGTGAT GGCTGCTGGT ATGGGCGTAC TCGGTACAAT GATGCATATC GGTATCCACT CTCCGTCGGC GCAGTATTTC GCCATCGCCA TGCTGCTGAT GTTTATTGTC GGTTTTGCCA TGAGTGCCGG TCCGCTGATT TGGGTACTGT GCTCCGAAAT TCAGCCGCTG AAAGGCCGCG ATTTTGGCAT CACCTGCTCC ACCGCCACCA ACTGGATTGC CAACATGATC GTTGGCGCAA CGTTCCTGAC CATGCTCAAC ACGCTGGGTA ACGCCAACAC CTTCTGGGTG TACGCGGCTC TGAACGTACT GTTTATCCTG CTGACATTGT GGCTGGTACC GGAAACCAAA CACGTTTCGC TGGAACATAT TGAACGTAAT CTGATGAAAG GTCGTAAACT GCGCGAAATC GGCGCTCACG ATTAA
|
Protein sequence | MPDAKKQGRS NKAMTFFVCF LAALAGLLFG LDIGVIAGAL PFIADEFQIT SHTQEWVVSS MMFGAAVGAV GSGWLSFKLG RKKSLMIGAI LFVAGSLFSA AAPNVEVLIL SRVLLGLAVG VASYTAPLYL SEIAPEKIRG SMISMYQLMI TIGILGAYLS DTAFSYTGAW RWMLGVIIIP AILLLIGVFF LPDSPRWFAA KRRFVDAERV LLRLRDTSAE AKRELDEIRE SLQVKQSGWA LFKENSNFRR AVFLGVLLQV MQQFTGMNVI MYYAPKIFEL AGYTNTTEQM WGTVIVGLTN VLATFIAIGL VDRWGRKPTL TLGFLVMAAG MGVLGTMMHI GIHSPSAQYF AIAMLLMFIV GFAMSAGPLI WVLCSEIQPL KGRDFGITCS TATNWIANMI VGATFLTMLN TLGNANTFWV YAALNVLFIL LTLWLVPETK HVSLEHIERN LMKGRKLREI GAHD
|
| |