Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1680 |
Symbol | gadA |
ID | 6145213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1680092 |
End bp | 1681492 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616556 |
Product | glutamate decarboxylase GadA |
Protein accession | YP_001743734 |
Protein GI | 170680042 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0076] Glutamate decarboxylase and related PLP-dependent proteins |
TIGRFAM ID | [TIGR01788] glutamate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA AGCAAGTAAC GGATTTAAGG TCGGAACTAC TCGATTCACG TTTTGGTGCA AAGTCTATTT CCACTATCGC AGAATCAAAA CGTTTTCCGC TGCACGAAAT GCGCGACGAT GTCGCATTTC AGATTATCAA TGACGAATTA TATCTTGATG GCAACGCTCG TCAGAACCTG GCCACTTTCT GCCAGACCTG GGACGACGAA AACGTCCACA AGTTGATGGA TTTATCCATT AACAAAAACT GGATCGACAA AGAAGAATAT CCGCAATCCG CAGCCATCGA CCTGCGTTGC GTAAACATGG TTGCCGATCT GTGGCATGCG CCTGCGCCGA AAAATGGTCA GGCCGTTGGC ACCAACACCA TTGGTTCTTC CGAGGCCTGT ATGCTTGGCG GGATGGCGAT GAAATGGCGT TGGCGCAAGC GTATGGAAGC TGCAGGCAAA CCAACGGATA AACCAAACCT GGTGTGCGGT CCGGTGCAAA TCTGCTGGCA TAAATTCGCC CGCTACTGGG ATGTGGAGTT GCGTGAGATC CCTATGCGCC CCGGTCAGTT GTTTATGGAC CCGAAACGCA TGATTGAAGC CTGCGACGAA AATACCATCG GCGTGGTGCC GACTTTCGGC GTGACCTACA CCGGTAACTA TGAGTTCCCG CAGCCGCTGC ACGATGCGCT GGATAAATTC CAGGCCGACA CCGGTATCGA CATCGACATG CACATCGACG CCGCCAGCGG TGGCTTCCTG GCACCGTTCG TCGCCCCGGA TATCGTCTGG GACTTCCGCC TGCCGCGTGT GAAATCGATC AGTGCTTCAG GCCATAAATT CGGTCTGGCT CCGCTGGGCT GCGGCTGGGT TATCTGGCGT GATGAAGAAG CGCTGCCGCA GGAACTGGTG TTCAACGTTG ACTACCTCGG CGGTCAGATT GGGACTTTCG CCATCAACTT CTCCCGCCCG GCGGGTCAGG TGATTGCACA GTACTATGAA TTCCTGCGCC TCGGTCGTGA AGGCTATACC AAAGTACAGA ACGCTTCCTA CCAGGTTGCC GCTTATCTGG CGGATGAAAT CGCCAAACTG GGACCGTATG AGTTTATCTG TACCGGTCGC CCGGACGAAG GCATCCCGGC GGTTTGCTTC AAACTGAAAG ATGGTGAAGA TCCGGGATAC ACCCTCTACG ACCTCTCTGA ACGTCTGCGT CTGCGCGGCT GGCAGGTTCC GGCCTTCACT CTCGGCGGTG AAGCCACTGA CATCGTGGTG ATGCGCATTA TGTGTCGTCG CGGCTTCGAA ATGGACTTTG CTGAACTGTT GCTGGAAGAC TACAAAGCCT CCCTGAAATA TCTCAGCGAT CACCCGAAAC TGCAGGGTAT TGCCCAGCAG AACAGCTTTA AACATACCTG A
|
Protein sequence | MDKKQVTDLR SELLDSRFGA KSISTIAESK RFPLHEMRDD VAFQIINDEL YLDGNARQNL ATFCQTWDDE NVHKLMDLSI NKNWIDKEEY PQSAAIDLRC VNMVADLWHA PAPKNGQAVG TNTIGSSEAC MLGGMAMKWR WRKRMEAAGK PTDKPNLVCG PVQICWHKFA RYWDVELREI PMRPGQLFMD PKRMIEACDE NTIGVVPTFG VTYTGNYEFP QPLHDALDKF QADTGIDIDM HIDAASGGFL APFVAPDIVW DFRLPRVKSI SASGHKFGLA PLGCGWVIWR DEEALPQELV FNVDYLGGQI GTFAINFSRP AGQVIAQYYE FLRLGREGYT KVQNASYQVA AYLADEIAKL GPYEFICTGR PDEGIPAVCF KLKDGEDPGY TLYDLSERLR LRGWQVPAFT LGGEATDIVV MRIMCRRGFE MDFAELLLED YKASLKYLSD HPKLQGIAQQ NSFKHT
|
| |