Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3253 |
Symbol | glcB |
ID | 6146304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3329745 |
End bp | 3331916 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618083 |
Product | malate synthase G |
Protein accession | YP_001745233 |
Protein GI | 170681765 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAA CCATAACCCA GGGCCGTTTA CGCATTGACG CCAATTTTAA ACGTTTTGTG GATGAAGAAG TTTTGCCTGG CGTGGAACTG GATGCTGCCG CGTTCTGGCA CAATGTTGAT GAGATCGTTC ACGATCTGGC GCCAGAGAAT CGTCAGTTGC TGGCAGAGCG CGATCGCATT CAGGCGGTGC TTGATGAGTG GCATCGCAGC AATCCGGGGC CGGTAAAAGA TAAAGCGGCC TATAAATCTT TCCTGCGTGA ACTGGGCTAC CTGGTGCCGC AACCGGAGCG CGTGACGGTG GAAACCACGG GTATTGACAG CGAAATCACC AGCCAGGCGG GGCCGCAGCT GGTGGTTCCG GCAATGAACG CCCGCTACGC GCTGAACGCG GCGAACGCTC GCTGGGGCTC ACTATACGAT GCGTTATACG GCAGCGACAT CATCCCGCAG GAAGGGGCGA TGGTCAGCGG CTACGATCCG CAACGCGGTG CGCAGGTTAT CGCCTGGGTT CGGCGTTTCC TCGATGAATC TCTACCGCTG GAAAACGGCA GCTATCAGGA TGTGGTGGCG TTTAAGGTGG TCGATAAACA ATTACGCATC CAGTTGAAAA ATGGTAAAGA AACCACGTTA CGTACCCCGG CGCAGTTTGT CGGTTACCGT GGCGATGCCG CTGCGCCGAC CTGCATTTTG CTGAAAAATA ACGGCCTGCA TATTGAACTG CAAATTGATG CCAACGGGCG GATTGGCAAA GACGATCCGG CGCACATCAA CGATGTTATC GTCGAAGCGG CCATCAGTAC CATTCTCGAC TGCGAAGATT CGGTCGCGGC GGTTGATGCG GAAGATAAAA TCCTGCTGTA CCGCAACCTG CTGGGCCTGA TGCAGGGGAC TCTGCAAGAG AAAATGGAGA AAAACGGTCG GCAAATCGTG CGTAAACTGA ATGACGATCG TCATTACACC GCCGCCGATG GCTCTGAAAT TTCTCTGCAC GGACGCTCGC TGCTGTTTAT CCGCAACGTG GGTCATTTGA TGACCATTCC TGTGATTTGG GACAGCGAAG GCAATGAAAT CCCGGAAGGC ATTCTTGATG GCGTCATGAC TGGCGCGATT GCCCTCTATG ATTTAAAAGT GCAGAAAAAC TCGCGCACTG GCAGCGTCTA TATTGTGAAA CCGAAAATGC ACGGTCCGCA GGAAGTGGCG TTCGCCAACA AACTGTTTAC CCGCATTGAG ACAATGCTCG GTATGGCACC GAATACCCTG AAAATGGGCA TTATGGATGA AGAACGTCGG ACCTCGCTGA ACTTGCGTAG CTGTATCGCT CAGGCGCGCA ACCGCGTGGC GTTCATCAAT ACCGGTTTCC TCGACCGTAC CGGCGATGAA ATGCATTCGG TGATGGAAGC TGGCCCGATG CTGCGTAAAA ATCAGATGAA ATCGACGCCT TGGATCAAAG CCTACGAGCG TAATAACGTG CTTTCCGGTC TGTTCTGTGG GCTGCGCGGT AAAGCGCAAA TTGGTAAAGG CATGTGGGCA ATGCCGGACC TGATGGCAGA CATGTACAGC CAGAAGGGCG ACCAACTGCG TGCCGGGGCA AACACAGCCT GGGTTCCGTC ACCAACCGCT GCTACGCTCC ATGCGCTGCA CTACCACCAA ACCAACGTAC AGAGCGTACA AGCCAACATT GCCCAGACCG AGTTCAATGC TGAATTTGAA CCGCTGCTGG ACGATCTGCT GACTATTCCG GTTGCTGAAA ACGCTAACTG GTCGGCACAA GAGATCCAAC AAGAGCTGGA TAACAACGTG CAGGGGATTC TGGGGTACGT GGTGCGCTGG GTGGAGCAGG GGATTGGTTG TTCAAAAGTG CCGGATATTC ACAATGTGGC GCTGATGGAA GACCGCGCAA CGCTGCGCAT CTCCAGCCAG CATATCGCCA ACTGGTTACG TCACGGTATT TTGACTAAAG AACAGGTGCA GGCGTCGCTG GAGAATATGG CGAAAGTGGT TGATCAGCAA AACGCTGGCG ATCCGGCTTA TCGTCCGATG GCGGGGAATT TCGCTAACTC GAGTGCTTTT AAAGCTGCCA GCGATTTAAT CTTCCTCGGC GTGAAACAGC CAAACGGTTA TACCGAACCG TTATTACACG CCTGGCGTTT ACGCGAAAAA GAAAGTCATT AA
|
Protein sequence | MSQTITQGRL RIDANFKRFV DEEVLPGVEL DAAAFWHNVD EIVHDLAPEN RQLLAERDRI QAVLDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPERVTV ETTGIDSEIT SQAGPQLVVP AMNARYALNA ANARWGSLYD ALYGSDIIPQ EGAMVSGYDP QRGAQVIAWV RRFLDESLPL ENGSYQDVVA FKVVDKQLRI QLKNGKETTL RTPAQFVGYR GDAAAPTCIL LKNNGLHIEL QIDANGRIGK DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE KMEKNGRQIV RKLNDDRHYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW DSEGNEIPEG ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA FANKLFTRIE TMLGMAPNTL KMGIMDEERR TSLNLRSCIA QARNRVAFIN TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP WIKAYERNNV LSGLFCGLRG KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA ATLHALHYHQ TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSAQ EIQQELDNNV QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI LTKEQVQASL ENMAKVVDQQ NAGDPAYRPM AGNFANSSAF KAASDLIFLG VKQPNGYTEP LLHAWRLREK ESH
|
| |