Gene Mlg_2726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2726 
Symbol 
ID4270980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3093025 
End bp3094545 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content63% 
IMG OID638127488 
Productaldehyde dehydrogenase 
Protein accessionYP_743556 
Protein GI114321873 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTACG CAAACCCCGG TGAGCGCGAC GCGAAAGTCC AGTTCAAACC CCGCTACGGA 
AACTTCATCA ATGGCGAGTG GGTAGAACCG GCCAGTGGCC AGTATTTTGA GAACATCACC
CCGGTAACGG GAGAAGTATT CTGCGAAGTC GCCCGGTCCA ATGCCGATGA CGTGGAACGC
GCCCTGGACG CGGCCCACGC CGCCAAAGAT GCCTGGGGGA AAGCGTCGGT GACCGAGCGT
GCCAACGTGC TGCTGAAGAT CGCCGATCGC ATGGAGGCCA ACTTGGAGCG CCTGGCGGTG
GCCGAGACCT GGGACAACGG GAAGCCCGTG CGTGAAACCC TCAACGCCGA TCTCCCGTTG
GCCATTGATC ACTTCCGCTA CTTCGCCGGG GCCATCCGGG CCCAGGAAGG CGGTATCAGC
GAGATCGACC ACGACACCAT CGCCTACCAC TTTCATGAGC CGCTGGGCGT GGTGGGGCAA
ATCATCCCCT GGAACTTCCC GCTGCTGATG GCCACCTGGA AGATCGCCCC GGCCCTGGCT
TGTGGCAACT GCATCGTGCT CAAGCCCGCC GAGCAGACAC CGGCGTCCAT CCTGGTGTTG
ATGGAGTGCA TCCAGGACGT GCTGCCCCCG GGGGTGCTGA ACGTGGTGAA CGGCTTCGGT
GTCGAGGCCG GCAAGCCGCT GGCCACCAGC AACCGCATCG CCAAGGTGGC GTTCACCGGC
GAGACCACCA CCGGGCGGCT GATCATGCAG TACGCCGCCG AGAACATCAT CCCGGTAACC
CTGGAGCTGG GCGGCAAATC GCCGAACATC TTCATGGCCG ACGTGATGGA CCAGGATGAC
GACTTCCTGG ACAAGGCCAT TGAGGGGATG ACCCTGGCGT GCCTGAACCA GGGCGAGGTC
TGCACCTGTC CCTCGCGCGC GCTGATCCAG GAGGACATCT ACGACGATTT CATTGCCAAG
GTGATCGACA GGTTCAGCAT GGTGAAACAG GGCAACCCGC TGGACACCGA AACCATGATC
GGGGCCCAGG CCTCCTCCGA GCAGATGGAG AAGATCCTGA GCTACATGGA CATCGGTCGG
CAGGAAGGCG CCGAGTGCCT CATCGGCGGC GACCGCGCCG AGATCGGCGG TGATTTCCAG
AACGGCTACT ACGTGCAGCC GACCCTGTTC CGCGGCCACA ACAAGATGCG CATCTTCCAG
GAGGAGATCT TCGGCCCGGT GGTTTCCGTC ACCACTTTCA AGGACGAGGC CGAGGCGCTG
GAGCTGGCCA ACGACACCCT CTACGGCCTG GGGGCCGGTC TCTGGAGCCG CAGCGCACAC
ACCACCTACC GCATGGGCCG GGCCATCCAG GCCGGCCGGG TGTGGACCAA CTGCTACCAC
CTGTACCCGG CCCATGCCGC CTTTGGTGGT TACAAGCAGT CCGGCATCGG GCGGGAGAAC
CACCAGATGA TGCTGGAGCA CTACCAGCAG ACCAAGAACC TGCTGGTGAG CTACAGCCCC
AAGGCCATGG GCTTTTTCTA A
 
Protein sequence
MIYANPGERD AKVQFKPRYG NFINGEWVEP ASGQYFENIT PVTGEVFCEV ARSNADDVER 
ALDAAHAAKD AWGKASVTER ANVLLKIADR MEANLERLAV AETWDNGKPV RETLNADLPL
AIDHFRYFAG AIRAQEGGIS EIDHDTIAYH FHEPLGVVGQ IIPWNFPLLM ATWKIAPALA
CGNCIVLKPA EQTPASILVL MECIQDVLPP GVLNVVNGFG VEAGKPLATS NRIAKVAFTG
ETTTGRLIMQ YAAENIIPVT LELGGKSPNI FMADVMDQDD DFLDKAIEGM TLACLNQGEV
CTCPSRALIQ EDIYDDFIAK VIDRFSMVKQ GNPLDTETMI GAQASSEQME KILSYMDIGR
QEGAECLIGG DRAEIGGDFQ NGYYVQPTLF RGHNKMRIFQ EEIFGPVVSV TTFKDEAEAL
ELANDTLYGL GAGLWSRSAH TTYRMGRAIQ AGRVWTNCYH LYPAHAAFGG YKQSGIGREN
HQMMLEHYQQ TKNLLVSYSP KAMGFF