Gene Mfla_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1995 
Symbol 
ID4000805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2133187 
End bp2134707 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content54% 
IMG OID637938912 
Productlactaldehyde dehydrogenase 
Protein accessionYP_546103 
Protein GI91776347 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000162834 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGG ATAGCTTGAA AGGCCTAGGT ATTAAGGTTC CGTACAAAAA TAAGTATGAC 
AATTATATTG GAGGGCAATG GGTTGCGCCG GTGGATGGCG AGTACTTCGA CAACATCAGC
CCGGTGACTG GCAAGGTATT CTGCCAGGTG CCGCGATCGA ATGAAAAGGA TATCAACCTG
GCCTTGGATG CCGCACATGC GGCCAAGGAG GCCTGGGCCA GGACTTCAGG GACCGGACGT
GCCAATATCC TCCTCAAGAT TGCCGATGTG ATGGAAGCCA ACCTGGAAAC TATCGCGATT
GCGGAGACCA TCGATAACGG AAAGCCCATC CGTGAAACCA TGGCTGCCGA CATTCCGCTG
GCGATCGACC ATTTCCGTTA TTTCGCTAGT TGCATCCGTA CCCAGGAAGG CGGTATCAGC
GAAATCGATC ATGAGACTGT GGCTTATCAC TTCCATGAGC CGCTGGGTGT GGTCGGCCAG
ATCATTCCAT GGAACTTCCC TATTTTGATG GCAGCATGGA AGCTGGCCCC AGCGCTGGCT
GCGGGCAATT GTATCGTCAT GAAACCTGCT GAGCAGACGC CGGCGTCGAT TTTGGTTGTC
ATGGAGCTGA TAGGCGATCT ATTGCCTCCC GGCGTGCTGA ATGTGGTGAA TGGCCATGGC
GTCGAGGCGG GAAAGGCGCT TGCAACCAGT CCGCGTATCG CCAAGATTGC CTTCACCGGC
TCCACCTCCG TTGGCCGACT GATCATGCAA TACGCGAGCC AGAACTTGAT TCCGGTGACA
CTCGAGCTTG GCGGCAAGTC TCCCAATATC TTCTTTGAAG ATGTGATGGA TAAGGACGAT
GCCTATTTCG ATAAGGCACT GGAAGGTTTT ACCCTGTTCG CATTGAACCA AGGCGAAATC
TGCACCTGTC CTAGCCGTGC ACTGATCCAG GAGTCCATCT ACGAACAGTT CATCGAGCGT
GCGATCAAGC GTGTCAAGGC GATCAAACAG GGCAGTCCAC TGGACAAGTC CACAATGATC
GGGGCGCAGG CTTCCAGCGA GCAGGTAGAG ATCATCATGT CCTATATCAA GTTGGGACTG
GAAGAGGGCG CACAGTTGCT GACCGGCGGA AATGCAACCA AGCTGCCAGG CGACCTGAGC
GAAGGCTACT ATATCGAGCC AACCGTGTTC AAGGGCCACA ACAAGATGCG CATTTTCCAG
GAAGAAATCT TCGGGCCGGT TTTGTCTGTT ACGACTTTCA AGGACGAGAA GGAAGCGCTG
GAAATCGCCA ATGACACCAT GTACGGCCTG GGTGCAGGTT TATGGACCCG TGACGGCAGC
CGTGCTTACC GTGTTGGTCG CGGGATCCAG GCCGGCCGCG TCTGGACCAA TTGCTACCAC
TTGTATCCAG CGCACGCTGC ATTCGGCGGT TATAAGCAGT CCGGCATTGG CCGCGAGAAC
CACCGCATGA TGCTGGATCA TTACCAGCAA ACCAAGAATC TCCTGGTCAG TTACAGCCCC
AATGCATTGG GCTTTTTCTA G
 
Protein sequence
MSLDSLKGLG IKVPYKNKYD NYIGGQWVAP VDGEYFDNIS PVTGKVFCQV PRSNEKDINL 
ALDAAHAAKE AWARTSGTGR ANILLKIADV MEANLETIAI AETIDNGKPI RETMAADIPL
AIDHFRYFAS CIRTQEGGIS EIDHETVAYH FHEPLGVVGQ IIPWNFPILM AAWKLAPALA
AGNCIVMKPA EQTPASILVV MELIGDLLPP GVLNVVNGHG VEAGKALATS PRIAKIAFTG
STSVGRLIMQ YASQNLIPVT LELGGKSPNI FFEDVMDKDD AYFDKALEGF TLFALNQGEI
CTCPSRALIQ ESIYEQFIER AIKRVKAIKQ GSPLDKSTMI GAQASSEQVE IIMSYIKLGL
EEGAQLLTGG NATKLPGDLS EGYYIEPTVF KGHNKMRIFQ EEIFGPVLSV TTFKDEKEAL
EIANDTMYGL GAGLWTRDGS RAYRVGRGIQ AGRVWTNCYH LYPAHAAFGG YKQSGIGREN
HRMMLDHYQQ TKNLLVSYSP NALGFF