Gene Mlg_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1675 
Symbol 
ID4268907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1915502 
End bp1918426 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content67% 
IMG OID638126433 
Productmolybdopterin oxidoreductase Fe4S4 region 
Protein accessionYP_742511 
Protein GI114320828 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CCGCGGCATC GCAGGCGGCC GGTCCCCGGC TGCCCGAGTT CGAGCCCCAT 
GACCGCCAGG AGGTCAAGAC CACCACCTGC TACATGTGCG CCTGCCGCTG CGGCATCAAG
GTGACGCTGG AGGAGGGCCA GGTCCGCTTC ATCCAGGGCA ACCCCGACCA CCCGGTCAAC
CGGGGTGCCC TTTGCGCCAA GGGCAATGCC GGCATCATGA AGCAGTACTC CCCGGCCAAG
CTCAGCAAGC CGCTGATGCG CAAGCCGGGC AGCGAACGCG GCAAGGGCGA TTTCGAAGAG
GTCTCCTGGG ACAAGGCGCT GGACGTGCTC GCACAGCGGC TCGCCGAGAT CCGGCGCACG
GACCCGAAGA AGCTCGCCTA CTTCACCGGC CGCGACCAGA TGCAGGCATT GACCGGCCTC
TGGGCAGCCC AATTCGGCAC GGTCAACTGG GCCGCCCACG GCGGGTTCTG CTCGGTCAAC
ATGGCCGCCG CCGGGCTCTA CACCATGGGC CATGCCTTCT GGGAGTTCGG CGATCCGGAC
TGGGAGCGCA CCAAGTACCT GATGCTTTGG GGGGTGGCCG AGGACCACGG CTCCAATCCC
TTCAAGATCG GCATCGGCAA GCTCAAGGGC CGCGGCGGCC GCTTCGTCGC CATCAATCCG
GTCCGCACCG GCTACCAGGC GGTCGCGGAT GAATGGGTGC CCATCCGCCC CGGCACCGAT
GGCATGCTCG CCATGGCGAT GATCCACGTG CTGTTGCGCG AGGAGCAGTT CGACTGGGAA
TACCTGATCC GCTACACCAA CGCCCACCAC CTGGTGGTGC AGACCCCCGG CGAGCCGGGC
CACGGGCTGC TGCTGCGCGA CGAGAACGAC AAGCCACTGA GCTGGGACCT GGAGCGCGAG
GCGTTCGTCG ACGCCACCCA ACCCGACATC GCCCCCGCCC TGTTCGGCGA CTACCAGGCG
CCCGATGGCC GCCCGGTGAA GACCGTGATG AGCCTGATGG CGGAACGCTA CCTGAGCGAG
GACTACAGCC CGGAGCGCGC CGCCGAGGTC TGCGGGGTGA GCGCCGACAC CATCGAGCGC
CTGGCGCTGG AACTGGCCCA CGTGGCCTTC AAAGAGAGCA TCGAGATCGA ATGCGAATGG
ACCGACTGGG CGGGTCGTAA GCACGACCGC ATCATCGGGC GCCCGGTCTC CATGCACGCG
ATGCGCGGGA TCTCCGCGCA CTCCAACGGT TTTCAGGCCT GCCGGGCCAT CCACCTGCTG
CAGATCCTGC TCGGCTCGGT GGATGTACCC GGGGGGCATC GGGCCAAACC GCCCTACCCC
AAGCCGGTGC CACCGCCCAT CAAACCCGCG CGACCGACGG CGCCGGGGGA ACCGCTGGCC
GGCCCGCCGC TGGGCTTCCC CAAGGCCCCG GAGGATCTAC TGATCGACGA CCAGGGCAAC
CCGCTGCGCC TGGACAAGGC CTACTCCTGG GAGGCGCCCC TGGCCAACCA CGGCATGATG
CACATGGTGA TCACCAATGC CGTGAAGGGC GACCCCTACC CCATCGACAC GCTGATGCTG
TTCATGGCCA ACATGGCCTG GAACTCCACC ATGAACACCT CCGGCGTGCT GAAGATGCTG
CGCGAGAAGA ACGGCAACGG GGAGTACAAG ATCCCCTTCC TGGTGGTGGT GGACGCCTTC
CACTCGGAGA CCGTGCAGTA CGCCGACCTG GTCCTGCCTG ACACCACCTA TCTGGAGCGC
CACGACGTAA TCTCCATGCT CGACCGGCCC ATCTCCGAGC CGGACGGGCC CGCGGACGCG
ATCCGCCAGC CGGTGGTGGA GCCGGACCGC GACGTGCGCC CCTGGCAGGA GGTGATGATC
GACCTGGCCG GCCGGCTCGG GCTGCCGGCG TTCACCAACG AGGACGGCAG CCCCAAGTAC
AGCGGCTATC TCGACTTCAT CACCCGCTTC GAGAAGGCAC CCGGCGTGGG CTTTCTCGCC
GGCTGGCGCG GCAAGGAGGG GGACCAGGAC CTGCGCGGTG AACCCAATCC GGACCAGTGG
ACGCGCTACA AGGAGAATGG CTGCTTCTAC AAGTACGAGC TGCCGCTCTC CCACCAGTTC
TACAAGTTCG CCAACAAGGG CTACCTGGAG TGGGCCCGCG ATGCCGGCCT CAACGGCAGC
GCCGATCCCA TCGTCTTCGA ACTCTACTCC GAGACCCTGC AGCGGTTCCG GCTCGCGGGC
CAGGGGCTCT ACGACGGCCC TTGTCCCACT GACCCGGAGG ACCGTCAGCG CCTGAGCAGC
TACTTCGATC CGCTGCCCTT CCACTACACC CCGCTGGAGG AGACCCGTTG CGACGGCGAG
GCCTACCCCT TCCACGCGGT CAACCAGCGC CCCATGTTCA TGTACCACTC CTGGGACAGC
CAGAACGCCT GGCTTCGTCA ACTGCAGGCC TATAACCGGC TGCACATCAA CCGGGCCCAG
GGTGAGCGCA TGGGCCTGGC CGATGACGAC TGGGTCTGGG TGGAGTCGCA CCTGGGCCGC
ATCCGGGTCC AGATCCGGCT CATGGAAGGC GTGCAGGAGA ACACCGTCTG GACCTGGAAT
GCCATTGCCA AGCAATCCGG CGCCTGGGGA CTGGATCCCG AGGCCCCGGA GGCGCGCCAG
GGGTTTCTCA TGAACCACCT GATCTCCGAG CTGCTGCCCA AGCGCAACGG CGGACGGGAC
ATCACTAACT CCGATCCCAT CACCGGCCAG GCCGCCTGGT ACGACCTGCG GGTCCGCATC
CGCAAGGCCG AGCCGGGCGA GACCGGCAGC TGGCCGCAGT TCGAGACCTT GCAACTCATC
CCCGGCGTGC AACGCGCGCC CGCCGATGAG CTGCGCTACG CCGTACACGA ACCGGTGCGC
CTGCACCGTT CCATGCACGA CATACTCAAC CGGAGAGGGG CATGA
 
Protein sequence
MSTTAASQAA GPRLPEFEPH DRQEVKTTTC YMCACRCGIK VTLEEGQVRF IQGNPDHPVN 
RGALCAKGNA GIMKQYSPAK LSKPLMRKPG SERGKGDFEE VSWDKALDVL AQRLAEIRRT
DPKKLAYFTG RDQMQALTGL WAAQFGTVNW AAHGGFCSVN MAAAGLYTMG HAFWEFGDPD
WERTKYLMLW GVAEDHGSNP FKIGIGKLKG RGGRFVAINP VRTGYQAVAD EWVPIRPGTD
GMLAMAMIHV LLREEQFDWE YLIRYTNAHH LVVQTPGEPG HGLLLRDEND KPLSWDLERE
AFVDATQPDI APALFGDYQA PDGRPVKTVM SLMAERYLSE DYSPERAAEV CGVSADTIER
LALELAHVAF KESIEIECEW TDWAGRKHDR IIGRPVSMHA MRGISAHSNG FQACRAIHLL
QILLGSVDVP GGHRAKPPYP KPVPPPIKPA RPTAPGEPLA GPPLGFPKAP EDLLIDDQGN
PLRLDKAYSW EAPLANHGMM HMVITNAVKG DPYPIDTLML FMANMAWNST MNTSGVLKML
REKNGNGEYK IPFLVVVDAF HSETVQYADL VLPDTTYLER HDVISMLDRP ISEPDGPADA
IRQPVVEPDR DVRPWQEVMI DLAGRLGLPA FTNEDGSPKY SGYLDFITRF EKAPGVGFLA
GWRGKEGDQD LRGEPNPDQW TRYKENGCFY KYELPLSHQF YKFANKGYLE WARDAGLNGS
ADPIVFELYS ETLQRFRLAG QGLYDGPCPT DPEDRQRLSS YFDPLPFHYT PLEETRCDGE
AYPFHAVNQR PMFMYHSWDS QNAWLRQLQA YNRLHINRAQ GERMGLADDD WVWVESHLGR
IRVQIRLMEG VQENTVWTWN AIAKQSGAWG LDPEAPEARQ GFLMNHLISE LLPKRNGGRD
ITNSDPITGQ AAWYDLRVRI RKAEPGETGS WPQFETLQLI PGVQRAPADE LRYAVHEPVR
LHRSMHDILN RRGA