Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03951 |
Symbol | melA |
ID | 8114188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4247080 |
End bp | 4248435 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644850104 |
Product | hypothetical protein |
Protein accession | YP_003001677 |
Protein GI | 251787373 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACTG CACCCAAAAT TACATTTATC GGCGCTGGTT CGACGATTTT CGTTAAAAAT ATTCTTGGTG ATGTGTTCCA TCGCGAGGCG CTGAAAACGG CGCATATTGC CCTGATGGAC ATTGATCCCA CCCGCCTGGA AGAGTCGCAT ATTGTGGTGC GTAAGCTGAT GGATTCAGCA GGGGCCAGCG GCAAAATCAC CTGCCACACC CAACAGAAAG AAGCCTTACA GGATGCCGAT TTTGTCGTGG TGGCATTTCA GATTGGCGGT TATGAACCTT GCACGGTGAC TGATTTCGAG GTCTGTAAGC GGCATGGTCT GGAACAAACC ATTGCCGATA CGTTGGGGCC GGGCGGTATT ATGCGCGCGC TACGTACCAT TCCGCATCTG TGGCAAATTT GCGAGGACAT GACGGAAGTC TGCCCCGATG CCACCATGCT CAACTATGTT AACCCAATGG CGATGAATAC CTGGGCGATG TATGCCCGCT ATCCGCATAT CAAACAGGTC GGGCTGTGCC ATTCGGTGCA GGGAACGGCG GAAGAGCTGG CGCGTGACCT CAATATCGAC CCGGCTACGC TGCGTTACCG TTGCGCAGGT ATCAACCATA TGGCGTTTTA CCTGGAGCTG GAGCGCAAAA CCGCCGACGG CAGTTACGTG AATCTCTACC CGGAACTGCT GGCGGCTCAT GACGCAGGGC AGGCACCGAA GCCGAATATT CACGGCAATA CTCGCTGCCA GAATATTGTG CGCTATGAAA TGTTCAAAAA GCTGGGCTAC TTCGTCACGG AATCGTCAGA ACATTTTGCT GAGTACACAC CGTGGTTTAT TAAGCCAGGT CGTGAGGATT TGATTGAGCG TTATAAAGTA CCGCTGGATG AGTACCCGAA ACGCTGCGTC GAGCAGCTGG CGAACTGGCA TAAAGAGCTG GAGGAGTATA AAAACGCCTC CCGGATTGAT ATTAAACCGT CACGGGAATA TGCCAGCACA ATCATGAACG CTATCTGGAC TGGCGAGCCG AGTGTGATTT ACGGCAACGT CCGTAACGAT GGTTTGATTG ATAACCTGCC ACAAGGATGT TGCGTGGAAG TAGCCTGTCT GGTTGATGCT AATGGCATTC AGCCGACCAA AGTCGGTACG CTACCTTCGC ATCTGGCCGC CCTGATGCAA ACCAACATCA ACGTACAGAC GCTGCTGACC GAAGCCATTC TTACGGAAAA TCGCGACCGT GTTTACCACG CCGCGATGAT GGACCCGCAT ACTGCCGCCG TGCTGGGCAT TGATGAAATA TATGCTCTTG TTGACGACCT GATTGCCGCC CACGGCGACT GGCTGCCAGG CTGGTTGCAC CGTTAA
|
Protein sequence | MMTAPKITFI GAGSTIFVKN ILGDVFHREA LKTAHIALMD IDPTRLEESH IVVRKLMDSA GASGKITCHT QQKEALQDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI MRALRTIPHL WQICEDMTEV CPDATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA EELARDLNID PATLRYRCAG INHMAFYLEL ERKTADGSYV NLYPELLAAH DAGQAPKPNI HGNTRCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIERYKV PLDEYPKRCV EQLANWHKEL EEYKNASRID IKPSREYAST IMNAIWTGEP SVIYGNVRND GLIDNLPQGC CVEVACLVDA NGIQPTKVGT LPSHLAALMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH TAAVLGIDEI YALVDDLIAA HGDWLPGWLH R
|
| |