Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3148 |
Symbol | glcB |
ID | 5593708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3159268 |
End bp | 3161439 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640922268 |
Product | malate synthase G |
Protein accession | YP_001459766 |
Protein GI | 157162448 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 62 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAA CCATAACCCA GAGCCGTTTA CGCATTGACG CCAATTTTAA ACGTTTTGTG GATGAAGAAG TTTTACCGGG AACAGGGCTG GACGCTGCGG CGTTCTGGCG CAATTTTGAT GAGATCGTTC ATGATCTGGC ACCAGAAAAT CGTCAGTTGC TGGCAGAACG CGATCGCATT CAGGCAGCGC TTGATGAGTG GCATCGCAGC AATCCGGGGC CGGTAAAAGA TAAAGCGGCC TATAAATCTT TCCTGCGTGA ACTGGGCTAC CTGGTGCCGC AACCGGAGCG CGTGACGGTG GAAACCACGG GCATTGACAG CGAAATCACC AGCCAGGCGG GGCCGCAGCT GGTGGTTCCG GCAATGAACG CCCGCTACGC GCTGAACGCG GCGAACGCTC GCTGGGGCTC ACTGTACGAT GCGTTATACG GCAGCGACAT CATCCCGCAG GAAGGGGCGA TGGTCAGCGG CTACGATCCG CAACGCGGTG AGCAGGTTAT CGCCTGGGTT CGGCGTTTCC TCGATGAATC TCTACCGCTG GAAAACGGCA GCTATCAGGA TGTGGTGGCG TTTAAGGTGG TTGATAAACA ATTACGCATC CAGTTGAAAA ATGGTAAAGA AACCACGTTA CGTACTCCAG CACAGTTTGT CGGTTACCGT GGCGATGCCG CTGCGCCGAC CTGCATTTTG CTGAAAAATA ACGGCCTGCA TATTGAGCTG CAAATCGATG CCAATGGGCG GATTGGCAAA GACGATCCGG CGCACATCAA CGATGTTATC GTCGAAGCTG CTATCAGTAC CATTCTCGAC TGCGAAGATT CGGTCGCGGC GGTTGATGCG GAAGATAAAA TCCTGCTGTA CCGCAACCTG CTGGGCCTGA TGCAGGGGAC TCTGCAAGAG AAAATGGAGA AAAACGGTCG GCAAATCGTG CGTAAACTGA ATGACGATCG TCATTACACC GCCGCCGATG GCTCTGAAAT TTCTCTGCAC GGACGCTCGC TGCTGTTTAT CCGCAACGTG GGTCATTTGA TGACCATTCC TGTGATTTGG GACAGCGAAG GCAATGAAAT CCCGGAAGGC ATTCTTGATG GCGTCATGAC TGGCGCGATT GCCCTCTATG ATTTAAAAGT GCAGAAAAAC TCGCGCACTG GCAGCGTCTA TATTGTGAAA CCGAAAATGC ACGGTCCGCA GGAAGTGGCG TTCGCCAACA AACTGTTTAC CCGCATTGAG ACAATGCTCG GTATGGCACC GAATACCCTG AAAATGGGCA TTATGGATGA AGAACGTCGG ACCTCGCTGA ACTTGCGTAG CTGTATCGCT CAGGCGCGCA ACCGCGTGGC GTTCATCAAT ACCGGTTTCC TCGACCGTAC CGGCGATGAA ATGCATTCGG TGATGGAAGC TGGCCCGATG CTGCGTAAAA ATCAGATGAA ATCGACGCCT TGGATCAAAG CCTACGAGCG TAATAACGTG CTTTCCGGTC TGTTCTGTGG GCTGCGCGGT AAAGCGCAAA TTGGTAAAGG CATGTGGGCA ATGCCGGACC TGATGGCAGA CATGTACAGC CAGAAGGGCG ACCAACTGCG TGCCGGGGCA AACACAGCCT GGGTTCCGTC ACCAACCGCT GCTACGCTCC ATGCGCTGCA CTACCACCAA ACCAACGTAC AGAGCGTACA AGCCAACATT GCCCAGACCG AGTTCAATGC TGAATTTGAA CCGCTGCTGG ACGATCTGCT GACTATTCCG GTTGCTGAAA ACGCTAACTG GTCGGCGCAA GAGATCCAAC AAGAGCTGGA TAACAACGTG CAGGGGATTC TGGGGTACGT GGTGCGCTGG GTGGAGCAGG GGATTGGTTG TTCAAAAGTG CCGGATATTC ACAATGTGGC GTTGATGGAA GACCGCGCAA CGCTGCGTAT CTCCAGCCAG CATATCGCCA ACTGGTTACG TCACGGTATT CTGACCAAAG AACAGGTGCA GGCGTCGCTG GAGAATATGG CGAAAGTGGT TGATCAGCAA AACGCTGGCG ATCCGGCTTA TCGTCCGATG GCGGGGAATT TCGCTAACTC GTGTGCTTTT AAAGCTGCCA GCGATTTAAT CTTCCTCGGC GTGAAACAGC CAAACGGCTA TACCGAACCG TTATTACACG CCTGGCGTTT ACGCGAAAAA GAAAGTCATT AA
|
Protein sequence | MSQTITQSRL RIDANFKRFV DEEVLPGTGL DAAAFWRNFD EIVHDLAPEN RQLLAERDRI QAALDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPERVTV ETTGIDSEIT SQAGPQLVVP AMNARYALNA ANARWGSLYD ALYGSDIIPQ EGAMVSGYDP QRGEQVIAWV RRFLDESLPL ENGSYQDVVA FKVVDKQLRI QLKNGKETTL RTPAQFVGYR GDAAAPTCIL LKNNGLHIEL QIDANGRIGK DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE KMEKNGRQIV RKLNDDRHYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW DSEGNEIPEG ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA FANKLFTRIE TMLGMAPNTL KMGIMDEERR TSLNLRSCIA QARNRVAFIN TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP WIKAYERNNV LSGLFCGLRG KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA ATLHALHYHQ TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSAQ EIQQELDNNV QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI LTKEQVQASL ENMAKVVDQQ NAGDPAYRPM AGNFANSCAF KAASDLIFLG VKQPNGYTEP LLHAWRLREK ESH
|
| |