Gene Mlg_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1894 
Symbol 
ID4270094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2159300 
End bp2160802 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content69% 
IMG OID638126650 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_742728 
Protein GI114321045 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.140191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC ACTCCGTATA CCGCGAGGAG GACTGCCGCT TCGAGTACGA TCGCGAGGAC 
CTGATCGCCC GCCTGGGCGG GGTTCTGGAC GACCCCGAAG CGCTGATCAC CAACGAAGAG
GCCCTACGGG CCTACGAGAC CGACGGTCTG GCCGCTTACC GCCAGCTGCC GGTAGCAGCG
GTCCTGCCGG ACACGGTGGA GCAGGTGCAG GCCATCCTGC GCATCTGCCA TGAGCTGGAG
GTGCCGGTGG TGGCCCGTGG TGCGGGCACC AGCCTCTCCG CCGGCGCCCT GCCCCACCCC
CAGGGTATTC TCCTCAGCCT GTCCCGCTTC AACCGCATTC TGGAGGTGGA CGCCGAACGG
CGCATTGCGC GGGTCCAGCC CGGGGTGCGC AACCTGGCGG TGTCCGAGGC CGCGGCCCCC
TACGGGCTTT ACTACGCCCC GGACCCCTCC TCGCAAATTG CCTGCAGCAT CGGCGGCAAC
GTGGCGGAGA ACGCCGGCGG GGTGCACTGC CTGAAGTATG GCCTGACCAT CCACAACGTC
CTCGAGGCCA CCCTGATCAC CATCGACGGC GATGTCATCA AGGTGGGCAG CGAGGCGCCG
GACGCCCCCG GATACGACCT GCTGGCGGCG GTGATCGGCT CCGAGGGCAT GCTCGGCGTG
GTGGTGGAGG TGGCGGTCAA GCTGCTGCCC GAGCCGTTGA CCAAGAAGGT GATGCTGGCC
GCCTTCCCCA CCGTCGAGGC CGGCGGCGAG GCGGTGGCCG GGATCATCGG CGACGGCATC
ATCCCCGGTG GCCTGGAGAT GATGGACAAC GCCGCCATCC GCGCCGCCGA GGACTTCGTC
CACGCCGGCT ACCCGGTGGA TGCCGCCACC ATCCTGATCT GCGAGCTGGA CGGCAGTGAG
GCGGAGGTGG CCGCCCAGTG CGACCGGGTG CGCAAGCTGA TGGAGCGCTA CGGGGCCACC
GAAATCCGCA TCGCCGAGAC CCCCGAACAG GCGCAGCGCT TCTGGGCCGG GCGCAAGGCG
GCCTTTCCCG CGGTGGGCCG TATCTCCCCC GACTACTACT GCATGGACGG CACCATCCCA
CGCAAGCACC TGGGCACGGT GCTCAAGCGC ATGCAGGCCC TCTCCGAGCA GTACGGCCTG
CGGGTGGTGA ACGTCTTTCA CGCCGGTGAC GGCAACCTGC ACCCGCTGGT GCTCTACGAC
GGCAACGTCC CGGGCGAGCT GGAACGCACC GAGGAGCTGG GCGGGCGCAT CCTGGAGTTG
TGTGTGGAGG TCGGCGGCAC GGTCACCGGC GAGCACGGCG TGGGCATGGA GAAGCTCGAC
CAGATGTGCG TGCAGTTCAA CAAGGCGGAG CGCGAGCAGT TCTTCGCCCT CAAGCGCGCC
TTCGATCCCA AGGGGCTGCT CAACCCCGGC AAGGCCATCC CCACCCTGCA CCGCTGCGCC
GAGTTCGGCG CCATGCACGT GCACCACGGC GAACTGCCCT TCCCGGACAT CGAGCGCTTC
TGA
 
Protein sequence
MSAHSVYREE DCRFEYDRED LIARLGGVLD DPEALITNEE ALRAYETDGL AAYRQLPVAA 
VLPDTVEQVQ AILRICHELE VPVVARGAGT SLSAGALPHP QGILLSLSRF NRILEVDAER
RIARVQPGVR NLAVSEAAAP YGLYYAPDPS SQIACSIGGN VAENAGGVHC LKYGLTIHNV
LEATLITIDG DVIKVGSEAP DAPGYDLLAA VIGSEGMLGV VVEVAVKLLP EPLTKKVMLA
AFPTVEAGGE AVAGIIGDGI IPGGLEMMDN AAIRAAEDFV HAGYPVDAAT ILICELDGSE
AEVAAQCDRV RKLMERYGAT EIRIAETPEQ AQRFWAGRKA AFPAVGRISP DYYCMDGTIP
RKHLGTVLKR MQALSEQYGL RVVNVFHAGD GNLHPLVLYD GNVPGELERT EELGGRILEL
CVEVGGTVTG EHGVGMEKLD QMCVQFNKAE REQFFALKRA FDPKGLLNPG KAIPTLHRCA
EFGAMHVHHG ELPFPDIERF