Gene EcSMS35_1680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1680 
SymbolgadA 
ID6145213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1680092 
End bp1681492 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID641616556 
Productglutamate decarboxylase GadA 
Protein accessionYP_001743734 
Protein GI170680042 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID[TIGR01788] glutamate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA AGCAAGTAAC GGATTTAAGG TCGGAACTAC TCGATTCACG TTTTGGTGCA 
AAGTCTATTT CCACTATCGC AGAATCAAAA CGTTTTCCGC TGCACGAAAT GCGCGACGAT
GTCGCATTTC AGATTATCAA TGACGAATTA TATCTTGATG GCAACGCTCG TCAGAACCTG
GCCACTTTCT GCCAGACCTG GGACGACGAA AACGTCCACA AGTTGATGGA TTTATCCATT
AACAAAAACT GGATCGACAA AGAAGAATAT CCGCAATCCG CAGCCATCGA CCTGCGTTGC
GTAAACATGG TTGCCGATCT GTGGCATGCG CCTGCGCCGA AAAATGGTCA GGCCGTTGGC
ACCAACACCA TTGGTTCTTC CGAGGCCTGT ATGCTTGGCG GGATGGCGAT GAAATGGCGT
TGGCGCAAGC GTATGGAAGC TGCAGGCAAA CCAACGGATA AACCAAACCT GGTGTGCGGT
CCGGTGCAAA TCTGCTGGCA TAAATTCGCC CGCTACTGGG ATGTGGAGTT GCGTGAGATC
CCTATGCGCC CCGGTCAGTT GTTTATGGAC CCGAAACGCA TGATTGAAGC CTGCGACGAA
AATACCATCG GCGTGGTGCC GACTTTCGGC GTGACCTACA CCGGTAACTA TGAGTTCCCG
CAGCCGCTGC ACGATGCGCT GGATAAATTC CAGGCCGACA CCGGTATCGA CATCGACATG
CACATCGACG CCGCCAGCGG TGGCTTCCTG GCACCGTTCG TCGCCCCGGA TATCGTCTGG
GACTTCCGCC TGCCGCGTGT GAAATCGATC AGTGCTTCAG GCCATAAATT CGGTCTGGCT
CCGCTGGGCT GCGGCTGGGT TATCTGGCGT GATGAAGAAG CGCTGCCGCA GGAACTGGTG
TTCAACGTTG ACTACCTCGG CGGTCAGATT GGGACTTTCG CCATCAACTT CTCCCGCCCG
GCGGGTCAGG TGATTGCACA GTACTATGAA TTCCTGCGCC TCGGTCGTGA AGGCTATACC
AAAGTACAGA ACGCTTCCTA CCAGGTTGCC GCTTATCTGG CGGATGAAAT CGCCAAACTG
GGACCGTATG AGTTTATCTG TACCGGTCGC CCGGACGAAG GCATCCCGGC GGTTTGCTTC
AAACTGAAAG ATGGTGAAGA TCCGGGATAC ACCCTCTACG ACCTCTCTGA ACGTCTGCGT
CTGCGCGGCT GGCAGGTTCC GGCCTTCACT CTCGGCGGTG AAGCCACTGA CATCGTGGTG
ATGCGCATTA TGTGTCGTCG CGGCTTCGAA ATGGACTTTG CTGAACTGTT GCTGGAAGAC
TACAAAGCCT CCCTGAAATA TCTCAGCGAT CACCCGAAAC TGCAGGGTAT TGCCCAGCAG
AACAGCTTTA AACATACCTG A
 
Protein sequence
MDKKQVTDLR SELLDSRFGA KSISTIAESK RFPLHEMRDD VAFQIINDEL YLDGNARQNL 
ATFCQTWDDE NVHKLMDLSI NKNWIDKEEY PQSAAIDLRC VNMVADLWHA PAPKNGQAVG
TNTIGSSEAC MLGGMAMKWR WRKRMEAAGK PTDKPNLVCG PVQICWHKFA RYWDVELREI
PMRPGQLFMD PKRMIEACDE NTIGVVPTFG VTYTGNYEFP QPLHDALDKF QADTGIDIDM
HIDAASGGFL APFVAPDIVW DFRLPRVKSI SASGHKFGLA PLGCGWVIWR DEEALPQELV
FNVDYLGGQI GTFAINFSRP AGQVIAQYYE FLRLGREGYT KVQNASYQVA AYLADEIAKL
GPYEFICTGR PDEGIPAVCF KLKDGEDPGY TLYDLSERLR LRGWQVPAFT LGGEATDIVV
MRIMCRRGFE MDFAELLLED YKASLKYLSD HPKLQGIAQQ NSFKHT