Gene Mlg_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1074 
Symbol 
ID4268996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1252757 
End bp1254679 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content60% 
IMG OID638125826 
Productnitrous-oxide reductase 
Protein accessionYP_741916 
Protein GI114320233 
COG category[C] Energy production and conversion 
COG ID[COG4263] Nitrous oxide reductase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.198434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA TGGACACAAA GACCACCACG GTGGACACCA CCAACAGTGA GGAATCGGCG 
GGCCTGCGGA ACCCGAGCCG TCGTAAGTTC CTCGGTACCA CGGCTGCCGT TGGGGCGGTC
AGCGCCGCGG GTGCCGCCGG CACCGGTGCG CTGGTTCGCT CCGGTGAGGT ACAGGCAGCC
GACCAATCCA TTCTAGAGAA GATCCACGTA GGGCCGGGGG ATCTCGACGA GTACTACGGC
TTCTGGAGCG GCGGTCACAA CGGCGAGGTT CGGGTTTACG GCGTGCCGTC GATGCGGGAG
ATCATGCGCA TCCCGGTATT CAATGTCTGT TCCGCAACCG GCTATGGCAT CAGTAACGAG
AGCAAGCGGA TTCTGGGCGA AAGCTCTAAG TTCCTGAATG GCGACGCCCA CCACCCGCAT
ATCAGTTACA CCGATGGCAA GCACGACGGC CGGTACCTGT TTATCAACGA CAAGGCCAAT
ACCCGCGTCG CCCGGGTCCG GCTTGACATC ATGAAGACGG ACAAGGTGAC CACCATCCCG
AACGCCCAGG CGATCCACGG CCTGCGGCTG CAGAAGGCGC CGAAAACCGG CTATGTCTAC
TGCAACGGTG AGATGATCAT TCCGTTGCCC AACGACGGCA CCGAGCTGGA GGATCCCAAG
AAGCACTTCT CCGTCTGGTC CGCGTTGGAT TCTGAGACCA TGGAGGTGGC CTGGCAGGTG
CTGGTGGATG GCAACCTGGA CAACATGGAC TCCAGCTATT GTGGGAAGTA CGGGGCATCC
ACCTGCTACA ACTCCGAGCA CGGCTTCACG CTTGAGGAGA AGATGCGGGC GGAGCGCGAC
CATGTGGTCA TCTTCAATAT CCCGCGCATC GAGGAGGCCG TCGAGAATGG CGAATACATG
ACCTTCGGTG ACAATGGCGT CCCGGTGGTC GATGGCCGAA AGGGTTCTCA GCTTACCCGG
TACATCCCGG TGCCCCGTAA CCCCCACGGC CTGAACGGCT CCACCGACGG CAAGTACTTC
ATTGCCAACG GCAAGCTGTC GCCCACGGTC ACCATGATCG ACATTTCCCG TTTGGATGAC
CTGTTCGATG ACAAAATCGA GCCGCGGGAT GCGGTCGCCG GCGAGCCGGA ACTGGGTCTG
GGGCCACTGC ATACTACTTT TGACGGGCGC GGCAACGCCT ACACTACGCT GTTCATCGAC
AGCCAGGTGG TCAAGTGGAA CATGGAGGAT GCGGTCCGCC ACTTCCAGGG CGAGAACGTC
AACTACATCC GGCAGAAGTT GAACGTGCAC TACCAGCCGG GCCACCTCAA GGCCACGCTG
GCCGAATCCA GTGAAGCGGA CGGCAAGTGG TTGTTCTCCC TTAATAAGTT CTCGAAGGAC
CGCTTCCTTC CCGTGGGGCC GCTGCACCCG GAGAACGATC AGATGATCGA CATCTCCGGC
GAGGAGATGA AGCTGGTCCA CGATACCCCG ACCTTTGCGG AGCCGCATGA CTGCGTGGTG
GTTCGGCGTG ACCAGATCCA GACCAAGCAG ATCTGGGAAC GCGACGATCC CTACTTCGCT
GAAACGGTGA AGATGGCGGA GGAGGACGGT GTCACTCTGA CCCGTGATAA TAAGGTCATC
CGTGACGGCA ACAAAGTGCG GGTCTATATG ACCCTGATTG CGCCGGAGTT CGGTATGAAC
CACTTCCGGG TGAAGCAGGG TGACGAGGTG ACCGTGGTCT GCACCAACCT CGACATGATC
CAGGACCTCA CCCACGGCTT CTGTGTGTGT GATCATGGGG TCAGTATCGA GGTCAGCCCG
CAGCAGACGG CATCGGTGAC CTTTACCGCC GACAAGGCCG GGGTCTACTG GTATTACTGC
AACTGGTTCT GCCATGCCAT GCACATGGAG ATGGCCGGTC GCATGATCGT GGAACCCGCG
TAA
 
Protein sequence
MKEMDTKTTT VDTTNSEESA GLRNPSRRKF LGTTAAVGAV SAAGAAGTGA LVRSGEVQAA 
DQSILEKIHV GPGDLDEYYG FWSGGHNGEV RVYGVPSMRE IMRIPVFNVC SATGYGISNE
SKRILGESSK FLNGDAHHPH ISYTDGKHDG RYLFINDKAN TRVARVRLDI MKTDKVTTIP
NAQAIHGLRL QKAPKTGYVY CNGEMIIPLP NDGTELEDPK KHFSVWSALD SETMEVAWQV
LVDGNLDNMD SSYCGKYGAS TCYNSEHGFT LEEKMRAERD HVVIFNIPRI EEAVENGEYM
TFGDNGVPVV DGRKGSQLTR YIPVPRNPHG LNGSTDGKYF IANGKLSPTV TMIDISRLDD
LFDDKIEPRD AVAGEPELGL GPLHTTFDGR GNAYTTLFID SQVVKWNMED AVRHFQGENV
NYIRQKLNVH YQPGHLKATL AESSEADGKW LFSLNKFSKD RFLPVGPLHP ENDQMIDISG
EEMKLVHDTP TFAEPHDCVV VRRDQIQTKQ IWERDDPYFA ETVKMAEEDG VTLTRDNKVI
RDGNKVRVYM TLIAPEFGMN HFRVKQGDEV TVVCTNLDMI QDLTHGFCVC DHGVSIEVSP
QQTASVTFTA DKAGVYWYYC NWFCHAMHME MAGRMIVEPA