Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0166 |
Symbol | hemL |
ID | 6147468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 183797 |
End bp | 185077 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615067 |
Product | glutamate-1-semialdehyde aminotransferase |
Protein accession | YP_001742283 |
Protein GI | 170680366 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0001] Glutamate-1-semialdehyde aminotransferase |
TIGRFAM ID | [TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAGT CTGAAAATCT TTACAGCGCA GCGCGCGAGC TGATCCCTGG CGGTGTGAAC TCCCCTGTTC GCGCCTTTAC TGGCGTGGGC GGCACTCCAC TGTTTATCGA AAAAGCGGAC GGCGCTTATC TGTACGATGT TGATGGCAAA GCCTATATCG ATTATGTCGG TTCCTGGGGG CCGATGGTGC TGGGCCATAA CCATCCGGCG ATCCGCAATG CCGTGATTGA AGCCGCCGAG CGTGGTTTAA GCTTTGGTGC ACCAACCGAA ATGGAAGTGA AAATGGCGCA ACTGGTGACC GAACTGGTCC CGACCATGGA TATGGTGCGC ATGGTGAACT CCGGCACTGA AGCGACGATG AGCGCCATCC GCCTGGCCCG TGGTTTTACC GGTCGCGACA AAATTATTAA ATTTGAAGGT TGTTACCACG GTCACGCTGA CTGCCTGCTG GTGAAAGCCG GTTCTGGCGC ACTCACGTTA GGCCAGCCAA ATTCGCCGGG CGTTCCGGCA GATTTCGCCA AACATACCTT AACCTGTACT TATAACGATC TGGCCTCTGT ACGCGCCGCG TTTGAGCAAT ACCCGCAAGA GATTGCCTGT ATTATCGTCG AGCCGGTGGC AGGCAATATG AACTGCGTTC CACCGCTGCC AGAGTTCCTG CCAGGTCTGC GCGCGCTGTG CGACGAATTT GGCGCGTTGC TGATCATCGA TGAAGTGATG ACCGGTTTCC GCGTAGCGCT AGCTGGCGCA CAGGATTATT ACGGCGTAGT GCCAGATTTA ACCTGCCTCG GCAAAATCAT CGGCGGTGGA ATGCCGGTAG GCGCATTCGG TGGTCGTCGT GATGTAATGG ATGCGCTGGC CCCGACTGGT CCGGTATATC AGGCGGGTAC GCTTTCCGGT AACCCGATTG CGATGGCAGC CGGTTTCGCC TGTCTGAATG AAGTCGCACA GCCGGGCGTT CACGAAACGC TGGATGAGCT GACAACACGT CTGGCAGAAG GTCTGCTGGA AGCGGCAGAA GAAGCCGGAA TTCCGCTGGT GGTAAACCAC GTTGGCGGCA TGTTCGGTAT TTTCTTTACC GACGCCGAGT CCGTGACGTG CTATCAGGAT GTGATGGCCT GTGACGTGGA ACGCTTTAAG CGTTTCTTCC ATATGATGCT GGACGAAGGT GTTTACCTGG CACCGTCAGC GTTTGAAGCG GGCTTTATGT CCGTGGCGCA TAGCATGGAA GATATCAATA ACACCATCGA TGCTGCACGT CGGGTGTTTG CGAAGTTGTA A
|
Protein sequence | MSKSENLYSA ARELIPGGVN SPVRAFTGVG GTPLFIEKAD GAYLYDVDGK AYIDYVGSWG PMVLGHNHPA IRNAVIEAAE RGLSFGAPTE MEVKMAQLVT ELVPTMDMVR MVNSGTEATM SAIRLARGFT GRDKIIKFEG CYHGHADCLL VKAGSGALTL GQPNSPGVPA DFAKHTLTCT YNDLASVRAA FEQYPQEIAC IIVEPVAGNM NCVPPLPEFL PGLRALCDEF GALLIIDEVM TGFRVALAGA QDYYGVVPDL TCLGKIIGGG MPVGAFGGRR DVMDALAPTG PVYQAGTLSG NPIAMAAGFA CLNEVAQPGV HETLDELTTR LAEGLLEAAE EAGIPLVVNH VGGMFGIFFT DAESVTCYQD VMACDVERFK RFFHMMLDEG VYLAPSAFEA GFMSVAHSME DINNTIDAAR RVFAKL
|
| |