Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4360 |
Symbol | melA |
ID | 5594347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4365932 |
End bp | 4367287 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640923458 |
Product | alpha-galactosidase |
Protein accession | YP_001460903 |
Protein GI | 157163585 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACTG CACCCAAAAT TACATTTATC GGCGCTGGTT CGACGATTTT CGTTAAAAAT ATTCTTGGTG ATGTGTTCCA TCGCGAGGCG CTGAAAACGG CGCATATTGC CCTGATGGAC ATTGATCCCA CCCGCCTGGA AGAGTCGCAT ATTGTGGTGC GTAAGCTGAT GGATTCAGCA GGGGCCAGCG GCAAAATCAC CTGCCACACC CAACAGAAAG AAGCCTTAGA GGATGCCGAT TTTGTCGTGG TGGCATTTCA GATTGGCGGT TATGAACCTT GCACGGTGAC TGATTTCGAG GTCTGTAAGC GGCATGGTCT GGAACAAACC ATTGCCGATA CGTTGGGGCC GGGCGGTATT ATGCGCGCGC TACGTACCAT TCCGCATCTG TGGCAAATTT GCGAGGACAT GACGGAAGTC TGCCCCGATG CCACCATGCT CAACTATGTT AACCCAATGG CGATGAATAC CTGGGCGATG TATGCCCGCT ATCCGCATAT CAAACAGGTC GGGCTGTGCC ATTCGGTGCA GGGAACGGCG GAAGAGCTGG CGCGTGACCT CAATATCGAC CCAGCTACGC TGCGTTACCG TTGCGCAGGT ATCAACCATA TGGCGTTTTA CCTGGAGCTG GAGCGCAAAA CCGCCGACGG CAGTTATGTG AATCTCTACC CGGAACTGCT GGCGGCTTAT GAAGCAGGGC AGGCACCGAA GCCGAATATT CATGGCAATA CTCGCTGCCA GAATATTGTG CGCTACGAAA TGTTCAAAAA GCTGGGCTAC TTCGTCACGG AATCGTCAGA ACATTTTGCT GAGTACACAC CGTGGTTTAT TAAGCCAGGT CGTGAGGATT TGATTGAGCG TTATAAAGTA CCGCTGGATG AGTACCCGAA ACGCTGCGTC GAGCAGCTGG CGAACTGGCA TAAAGAGCTG GAGGAGTATA AAAACGCCTC CCGGATTGAT ATTAAACCGT CACGGGAATA TGCCAGCACA ATCATGAACG CTATCTGGAC TGGCGAGCCG AGTGTGATTT ACGGCAACGT CCGTAACGAT GGTTTGATTG ATAACCTGCC ACAAGGATGT TGCGTGGAAG TAGCCTGTCT GGTTGATGCT AATGGCATTC AGCCGACCAA AGTCGGTACG CTACCTTCGC ATCTGGCCGC CCTGATGCAA ACCAACATCA ACGTACAGAC GCTGCTGACC GAAGCTATTC TTACGGAAAA TCGCGACCGT GTTTACCACG CCGCGATGAT GGACCCGCAT ACTGCTGCCG TGCTGGGCAT TGATGAAATA TATGCTCTTG TTGACGACCT GATTGCCGCC CACGGCGACT GGCTGCCAGG CTGGTTGCAC CGTTAA
|
Protein sequence | MMTAPKITFI GAGSTIFVKN ILGDVFHREA LKTAHIALMD IDPTRLEESH IVVRKLMDSA GASGKITCHT QQKEALEDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI MRALRTIPHL WQICEDMTEV CPDATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA EELARDLNID PATLRYRCAG INHMAFYLEL ERKTADGSYV NLYPELLAAY EAGQAPKPNI HGNTRCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIERYKV PLDEYPKRCV EQLANWHKEL EEYKNASRID IKPSREYAST IMNAIWTGEP SVIYGNVRND GLIDNLPQGC CVEVACLVDA NGIQPTKVGT LPSHLAALMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH TAAVLGIDEI YALVDDLIAA HGDWLPGWLH R
|
| |