Gene GM21_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1231 
Symbol 
ID8136556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1436997 
End bp1438307 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID644868845 
Producthomoserine dehydrogenase 
Protein accessionYP_003021050 
Protein GI253699861 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.000207213 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAGA TAAAGATTGG CCTTCTCGGT TTCGGCACCA TCGGTGCCGG TGTGGTAAAG 
CTTCTTGCCG CGAACAGCGC GCTTTTGACG GAGAAAACCG GCACCCGGAT TTCCCTGAAG
AGCATCGTCG ACCTCGACAT CACCACCGAC CGCGGCGTCA GCACCGAAGG GATCATTCTC
ACCCGCAACG CCGAAGAAGT GCTGACCGAT CCGGAAATCT CCGTAATCAT CGAGCTGATC
GGCGGCTACG AACCCGCCCG CAGCTTCGTC CTTAAGGCCA TCTCTCACGG CAAGCACGTG
GTAACCGCCA ATAAGGCACT GCTGGCTGTT CACGGCGAGG AAATATACGC TGCCGCCAAG
GCAAACGGCG TCGAAATCCT TTTCGAGGCT GCCGTGGGCG GGGGCATCCC GGTCCTTTCC
GCCATCAAGG GGAACCTGGC GGGGAACCGT TTCTCGACCG TGCTCGGCAT CGTGAACGGC
ACCTGCAACT ACATCCTGAC CCGGATGACC CATGATGGAG CGGACTTCTC CGAAGTGCTG
AAGAGCGCCC AGGAATTAGG CTACGCCGAA GCCGACCCCA CCTTCGATAT AGAGGGCGTC
GACACGGCGC ACAAACTCTG CCTGCTGCTC TCCCTTTGTT TCGGAACCAA GATCGACCTG
AAAGACATCT ATTCCGAAGG GATCACCTCT ATTTCCTCGG AAGACGTCGA CTTCGCCAAA
TCCTTCGGCT ATCGCATCAA GCTTTTGGCC ATAGGGAAGA TGGATGGCGG GCAGATCGAG
GCGCGCGTGC ATCCGACCAT GATCCCGATC GACTACCCGC TGGCCGACGT GCACGGCGTG
TTCAACGCCA TCAGGCTCAC CGGCGACTTC ATCGGCCCGG TGATGTTCTA TGGCAGGGGC
GCGGGGATGG ACGCAACCGC GAGCGCCGTA GTAGGCGACG TCATCGACCT TTCCCGGAGC
ATGGGGGCCG GCATCTCCCG CCGCTGCGCG CCGCTTGGGT ACCTCGACGA GAAAGTCAGC
ACCCTGCCGA TCAAGCCCAT GGGCGAGATC GTGAGCAAGT ACTACATCCG TTTCCAGGCG
CTGGATCGTC CCGGCGTGCT GGCGCGCATT GCTGGAGCCC TGGGGGCCAG CGGCATCAGC
ATCGCCTCCA TGCTGCAGAG CGCCAGAAGC GCGAGCGAAA TAGTTCCCAT CGTCATCATG
ACCCACGAGG CGCGCGAAGC AGACGTGCGC CGGGCGCTTG CCGAGATCGA CACCTACGAG
GTCATCCGCG GCAAGAGCAC CTTCATCAGG ATCGAGGACA ACCTCGAATA A
 
Protein sequence
MKEIKIGLLG FGTIGAGVVK LLAANSALLT EKTGTRISLK SIVDLDITTD RGVSTEGIIL 
TRNAEEVLTD PEISVIIELI GGYEPARSFV LKAISHGKHV VTANKALLAV HGEEIYAAAK
ANGVEILFEA AVGGGIPVLS AIKGNLAGNR FSTVLGIVNG TCNYILTRMT HDGADFSEVL
KSAQELGYAE ADPTFDIEGV DTAHKLCLLL SLCFGTKIDL KDIYSEGITS ISSEDVDFAK
SFGYRIKLLA IGKMDGGQIE ARVHPTMIPI DYPLADVHGV FNAIRLTGDF IGPVMFYGRG
AGMDATASAV VGDVIDLSRS MGAGISRRCA PLGYLDEKVS TLPIKPMGEI VSKYYIRFQA
LDRPGVLARI AGALGASGIS IASMLQSARS ASEIVPIVIM THEAREADVR RALAEIDTYE
VIRGKSTFIR IEDNLE