Gene GM21_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3351 
Symbol 
ID8138718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3877753 
End bp3878772 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID644870969 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_003023134 
Protein GI253701945 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.0830053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TTTGGAATGT GGCAGTGGTA GGCGCGACCG GCGCGGTCGG AACCCAGATG 
ATCGAGTGCC TGGAGGAGCG GAAGTTCCCG GTGGGAAAGA TAAAGTTCCT GGCCAGCGCC
CGCAGCGCAG GGAAGGTCCT TGAGTTCAAC GGCAAGCCCG TGCCGGTGGA AGAGCTGAAA
CACGACTCCT TCGAGGGGAT CGACATTGCC CTCTTCTCCG CAGGGGGCGC GCGCTCCGAG
GAGTTCTGCC CCTCCGCCGC CAAGGCTGGC GCTGTCTGCA TCGACAACTC CAGCGCCTGG
CGCATGGACC CGGAGGTGCC GTTGGTGGTC CCCGAGGTGA ACCCCCACGC GCTTGCCGGC
TACCGCAAGA AGGGAATCGT CGCCAACCCC AACTGCTCCA CCATCCAGAT GGTGGTCGCC
TTGAAGCCCC TGCACGACTT CGGGTCCATC AAGCGGATCG TCGTCTCCAC CTACCAGGCG
GTTTCCGGCA CCGGCAACAA GGCGATCGAC GAGCTGCGCA AGCAGACCGG AGAGCTTTTG
AACGGGCGGC CGCCCAAGAA CGAGGTCTAT CCGCACCGGA TCGCCTTCAA CTGCCTGCCG
CAAATCGATT CCTTCTGCGA CAACGGTTAC ACCAAGGAAG AGATGAAGAT GGTGAACGAG
ACCCGGAAGA TCATGGAGGC GGACATCAAG ACCACCGCCA CCTGCGTCAG GGTTCCCGTC
TTCTACGGGC ATTCCGAGTC GGTGAACGTA GAGACCGCGA AGAAGATCAC CGTGGCCAAG
GCGCGCGAGC TATTGGAAGA CGCGCCCGGC GTGGAACTGG TCGACAACCC CGCCAACGGC
GAGTATCCGA TGGCGATGGA CGCCGCGGGC GAGGACCTGA CCCTCGTAGG TCGCATCCGC
GAGGACGCCA CCGTCGCCAA CGGACTCAAC CTCTGGATCG TCGCCGACAA CCTCAGGAAG
GGCGCCGCCA CTAACGCAGT GCAGATCGCG GAGCTGCTGG TGGATGAGTA CCTGAAGTAA
 
Protein sequence
MKKLWNVAVV GATGAVGTQM IECLEERKFP VGKIKFLASA RSAGKVLEFN GKPVPVEELK 
HDSFEGIDIA LFSAGGARSE EFCPSAAKAG AVCIDNSSAW RMDPEVPLVV PEVNPHALAG
YRKKGIVANP NCSTIQMVVA LKPLHDFGSI KRIVVSTYQA VSGTGNKAID ELRKQTGELL
NGRPPKNEVY PHRIAFNCLP QIDSFCDNGY TKEEMKMVNE TRKIMEADIK TTATCVRVPV
FYGHSESVNV ETAKKITVAK ARELLEDAPG VELVDNPANG EYPMAMDAAG EDLTLVGRIR
EDATVANGLN LWIVADNLRK GAATNAVQIA ELLVDEYLK