Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1074 |
Symbol | |
ID | 4268996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1252757 |
End bp | 1254679 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638125826 |
Product | nitrous-oxide reductase |
Protein accession | YP_741916 |
Protein GI | 114320233 |
COG category | [C] Energy production and conversion |
COG ID | [COG4263] Nitrous oxide reductase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.198434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA TGGACACAAA GACCACCACG GTGGACACCA CCAACAGTGA GGAATCGGCG GGCCTGCGGA ACCCGAGCCG TCGTAAGTTC CTCGGTACCA CGGCTGCCGT TGGGGCGGTC AGCGCCGCGG GTGCCGCCGG CACCGGTGCG CTGGTTCGCT CCGGTGAGGT ACAGGCAGCC GACCAATCCA TTCTAGAGAA GATCCACGTA GGGCCGGGGG ATCTCGACGA GTACTACGGC TTCTGGAGCG GCGGTCACAA CGGCGAGGTT CGGGTTTACG GCGTGCCGTC GATGCGGGAG ATCATGCGCA TCCCGGTATT CAATGTCTGT TCCGCAACCG GCTATGGCAT CAGTAACGAG AGCAAGCGGA TTCTGGGCGA AAGCTCTAAG TTCCTGAATG GCGACGCCCA CCACCCGCAT ATCAGTTACA CCGATGGCAA GCACGACGGC CGGTACCTGT TTATCAACGA CAAGGCCAAT ACCCGCGTCG CCCGGGTCCG GCTTGACATC ATGAAGACGG ACAAGGTGAC CACCATCCCG AACGCCCAGG CGATCCACGG CCTGCGGCTG CAGAAGGCGC CGAAAACCGG CTATGTCTAC TGCAACGGTG AGATGATCAT TCCGTTGCCC AACGACGGCA CCGAGCTGGA GGATCCCAAG AAGCACTTCT CCGTCTGGTC CGCGTTGGAT TCTGAGACCA TGGAGGTGGC CTGGCAGGTG CTGGTGGATG GCAACCTGGA CAACATGGAC TCCAGCTATT GTGGGAAGTA CGGGGCATCC ACCTGCTACA ACTCCGAGCA CGGCTTCACG CTTGAGGAGA AGATGCGGGC GGAGCGCGAC CATGTGGTCA TCTTCAATAT CCCGCGCATC GAGGAGGCCG TCGAGAATGG CGAATACATG ACCTTCGGTG ACAATGGCGT CCCGGTGGTC GATGGCCGAA AGGGTTCTCA GCTTACCCGG TACATCCCGG TGCCCCGTAA CCCCCACGGC CTGAACGGCT CCACCGACGG CAAGTACTTC ATTGCCAACG GCAAGCTGTC GCCCACGGTC ACCATGATCG ACATTTCCCG TTTGGATGAC CTGTTCGATG ACAAAATCGA GCCGCGGGAT GCGGTCGCCG GCGAGCCGGA ACTGGGTCTG GGGCCACTGC ATACTACTTT TGACGGGCGC GGCAACGCCT ACACTACGCT GTTCATCGAC AGCCAGGTGG TCAAGTGGAA CATGGAGGAT GCGGTCCGCC ACTTCCAGGG CGAGAACGTC AACTACATCC GGCAGAAGTT GAACGTGCAC TACCAGCCGG GCCACCTCAA GGCCACGCTG GCCGAATCCA GTGAAGCGGA CGGCAAGTGG TTGTTCTCCC TTAATAAGTT CTCGAAGGAC CGCTTCCTTC CCGTGGGGCC GCTGCACCCG GAGAACGATC AGATGATCGA CATCTCCGGC GAGGAGATGA AGCTGGTCCA CGATACCCCG ACCTTTGCGG AGCCGCATGA CTGCGTGGTG GTTCGGCGTG ACCAGATCCA GACCAAGCAG ATCTGGGAAC GCGACGATCC CTACTTCGCT GAAACGGTGA AGATGGCGGA GGAGGACGGT GTCACTCTGA CCCGTGATAA TAAGGTCATC CGTGACGGCA ACAAAGTGCG GGTCTATATG ACCCTGATTG CGCCGGAGTT CGGTATGAAC CACTTCCGGG TGAAGCAGGG TGACGAGGTG ACCGTGGTCT GCACCAACCT CGACATGATC CAGGACCTCA CCCACGGCTT CTGTGTGTGT GATCATGGGG TCAGTATCGA GGTCAGCCCG CAGCAGACGG CATCGGTGAC CTTTACCGCC GACAAGGCCG GGGTCTACTG GTATTACTGC AACTGGTTCT GCCATGCCAT GCACATGGAG ATGGCCGGTC GCATGATCGT GGAACCCGCG TAA
|
Protein sequence | MKEMDTKTTT VDTTNSEESA GLRNPSRRKF LGTTAAVGAV SAAGAAGTGA LVRSGEVQAA DQSILEKIHV GPGDLDEYYG FWSGGHNGEV RVYGVPSMRE IMRIPVFNVC SATGYGISNE SKRILGESSK FLNGDAHHPH ISYTDGKHDG RYLFINDKAN TRVARVRLDI MKTDKVTTIP NAQAIHGLRL QKAPKTGYVY CNGEMIIPLP NDGTELEDPK KHFSVWSALD SETMEVAWQV LVDGNLDNMD SSYCGKYGAS TCYNSEHGFT LEEKMRAERD HVVIFNIPRI EEAVENGEYM TFGDNGVPVV DGRKGSQLTR YIPVPRNPHG LNGSTDGKYF IANGKLSPTV TMIDISRLDD LFDDKIEPRD AVAGEPELGL GPLHTTFDGR GNAYTTLFID SQVVKWNMED AVRHFQGENV NYIRQKLNVH YQPGHLKATL AESSEADGKW LFSLNKFSKD RFLPVGPLHP ENDQMIDISG EEMKLVHDTP TFAEPHDCVV VRRDQIQTKQ IWERDDPYFA ETVKMAEEDG VTLTRDNKVI RDGNKVRVYM TLIAPEFGMN HFRVKQGDEV TVVCTNLDMI QDLTHGFCVC DHGVSIEVSP QQTASVTFTA DKAGVYWYYC NWFCHAMHME MAGRMIVEPA
|
| |