Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1702 |
Symbol | |
ID | 4269788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1946242 |
End bp | 1948956 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126460 |
Product | assimilatory nitrate reductase (NADH) alpha subunit apoprotein |
Protein accession | YP_742538 |
Protein GI | 114320855 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG1251] NAD(P)H-nitrite reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0601205 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.585454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACAA GCATCCAACC CCCACGGCAG CCGGGCGAGG TGCACACCAC CTGCCCCTAT TGTGGCGTCG GTTGCGGCGT GCTCGCCCGG ATGGACAAGG ACAACCGGGT GCATGTCCGG GGCGACCCGG AGCACCCCGC CAACTTCGGC CGCCTCTGCT CCAAGGGGGC GGCCCTGGGC GAAACCACCG GTCTCAATGG CCGGCTGTTG CACCCCCGGG TCCATGGTCA GCGCACCGAC TGGGACACCG CGCTTGCGAC CGTGGCCGAC GGCCTCAACC GGGTCATCCG GCGCGACGGG CCCGAGGCGG TTGGCTTCTA CGTCTCGGGC CAGCTCCTCA CCGAGGACTA CTACGTCGCC AACAAGCTAA TGAAGGGCTA CATCGGCTCG GCGAACATCG ACACCAATTC CAGGCTCTGC ATGGCCTCTT CGGTCGCAGG CCACATTAGG GCCTTTGGTG AGGATCTGGT GCCCGGCTGT TACGAAGACC TGGAACGCGC CGACCTGGTG GTCCTCACCG GTTCCAACCT GGCCTGGTGC CACCCGGTGC TCTATCAACG CCTGGAGGCG GCGCGGGCAA GGCGTCCCGG CATGCGAGTG GTGGTGATCG ACCCGCGCGA GACTGAGACC GCCGCGCAGG CCGATCTCCA CCTGCCCCTG CGCCCCGGCA GTGACGTGGC ACTGTGGAAC GGACTATTCC GCCGTTTGGT GGAGACCGAC CAGATCCACC GCGATATCCG CCCGGAGACG GAAGCCGCCG ACAGCAACCT CAAGCGAATA CTGCACGAGG CAGGCGACGC CGACGCCGTC GCCAGCGCCT GCGACTTGAA TCCCCGCGCC CTGCACCTGT TCTACGACTG GTTCGCCGGC ACCGAACGCA CGGTCAGCGT ATACTCCCAG GGCGTTAACC AGTCCAGTCA GGGCACCGAC AAAGTCAATG CGATCATCAA CGTCCACATC CTTACCGGAC GCCTGGGGCG TCCCGGTTGC GGCCCCTTCT CGGTAACCGG TCAGCCCAAT GCCATGGGCG GTCGCGAGGT GGGGGGCCTC TCTAACCAGT TGGCCGCCCA TATGGGCTTT TCTCAGGGGG ACCGGATTCG GGTTCAACGC TACTGGGACT CGCCCCGAAT CGCCCGACAA CCCGGCATGA AGGCGGTAGA GCTATTTCGT GCCGCCGCCC AAGGCCGCGT CAAAGCGCTC TGGATTATGG CCACCAACCC AGTGGTCAGC CTTCCCGACG CCGACCTGGT GAAACGCGCG CTCGCACGCT GCGAGCTGGT CGTGGTTTCC GACTGTATCG CCAACACGGA TACGACCGGC TATGCCCACG TGCTTATGCC CGCTGCGGCC TGGGGTGAAA AGAGCGGTAC GGTGACCAAC TCAGAGCGGT GCATCTCACG CCAACGCGCC TTCCGTACGC CGGCCGGCGA GGCCCGCCCC GACTGGTGGA TCATCAGTCA AGTAGCCCGC CTGATGGGGT TTGATCGCGG CTTCGCGTAC GGGAGTCCGC GGGAGATATT CGACGAGCAC GCCCGACTGA CCGCGGTGCA CAACCCTGGC CCCGGAATGG GCGGCCGGAC ACTTCACCTA GGGGCGCTGG CCGGAATGAA CGCCAAACAG TGGGACACCC TGCAGCCTGT TCAATGGCCG TGTCCCGGCC CGGGCCTCGA CACCGCCGGG CCAGGGCGGG GCACCCAGCG CCTTGCCACC AACGGAGGAC TGCCCACGGA GGATGGCCAC CCCCGCCTGC ACCCGGTGTG CGCCGAAACA CCAGGAAACA CGCCAGATGC CCGATTCCCG CTTGTTCTCA ACACCGGGCG GACGCGTGAT CACTGGCACA CGCTCACGCG CACCGGCCTG TCCGTCAGTC TGAGCACCCA TCAGCCGGAA CCCCGGTGCG ACCTGCACCC CGAGGACGCC ACACGCTTCG GCCTGGCGGA CGGCCACCTG GTGCGGGTAC GCAGCGCCTG GGGCAGCGTT CTGCTGCGAG CCCGATACCA GACCGGTCAG CGCCGGGGCG AACTCTTCGT GCCCATGCAC TGGAACGACT GCTATGCCGC GCAGGCGCGC ATCGGCGCGG TGGCCAACCC GATCACCGAT CCGATATCCG GTCAGCCAGA GTTGAAACAC ACCCCGGTCG CTGTGGAGGC CGTGCCCGCC GCGGCCTACG GCTTTGTGCT CAGCAGGGCG GACGACCTGC CGCCACCCGA GACGGCCTAC TGGGTTCGGA TCAACGGGCA CGCTCATCAG CGGTTCGTGT TCGCCTCCAC CGAGGCCCCT GACAGCTGGC GCGATTGGGC GGGCCGTTGG CTTGGTACGG AGGGGACCGT GGTGGAAATG GCCGACCGTG CCCGGGGCGT CTACCGATTT GCCCGGCTGG TGGACGACCG GCTGGTGGCG TGCCTGTTCA TCGCCGCCGA CCCCGCCGCC CTGCCCGGCT GCGACTGGCT TGCAGGGCTG CTGGACAATA ACCGCCCCCT CGGCGCCGAG GAACGCCTCG CTCTGCTGGC CGGCCGCCCT GCCGGTGAAG CCGAGACCGG CGAGACCGTC TGCGCCTGCT TCGGTGTCGG CGAGAGAATC ATCGAATCCG CGGTGGCCGC CGGGGCTCAC GATACCGAGG CGGTGACCCG CCACTGCAAG GCGGGGGGCT ACTGCGGCAG TTGCCGGCCC GCGATCAACG CCATCATCCA ACGCCTGGCG CGCAGTGCCG CCTGA
|
Protein sequence | MNTSIQPPRQ PGEVHTTCPY CGVGCGVLAR MDKDNRVHVR GDPEHPANFG RLCSKGAALG ETTGLNGRLL HPRVHGQRTD WDTALATVAD GLNRVIRRDG PEAVGFYVSG QLLTEDYYVA NKLMKGYIGS ANIDTNSRLC MASSVAGHIR AFGEDLVPGC YEDLERADLV VLTGSNLAWC HPVLYQRLEA ARARRPGMRV VVIDPRETET AAQADLHLPL RPGSDVALWN GLFRRLVETD QIHRDIRPET EAADSNLKRI LHEAGDADAV ASACDLNPRA LHLFYDWFAG TERTVSVYSQ GVNQSSQGTD KVNAIINVHI LTGRLGRPGC GPFSVTGQPN AMGGREVGGL SNQLAAHMGF SQGDRIRVQR YWDSPRIARQ PGMKAVELFR AAAQGRVKAL WIMATNPVVS LPDADLVKRA LARCELVVVS DCIANTDTTG YAHVLMPAAA WGEKSGTVTN SERCISRQRA FRTPAGEARP DWWIISQVAR LMGFDRGFAY GSPREIFDEH ARLTAVHNPG PGMGGRTLHL GALAGMNAKQ WDTLQPVQWP CPGPGLDTAG PGRGTQRLAT NGGLPTEDGH PRLHPVCAET PGNTPDARFP LVLNTGRTRD HWHTLTRTGL SVSLSTHQPE PRCDLHPEDA TRFGLADGHL VRVRSAWGSV LLRARYQTGQ RRGELFVPMH WNDCYAAQAR IGAVANPITD PISGQPELKH TPVAVEAVPA AAYGFVLSRA DDLPPPETAY WVRINGHAHQ RFVFASTEAP DSWRDWAGRW LGTEGTVVEM ADRARGVYRF ARLVDDRLVA CLFIAADPAA LPGCDWLAGL LDNNRPLGAE ERLALLAGRP AGEAETGETV CACFGVGERI IESAVAAGAH DTEAVTRHCK AGGYCGSCRP AINAIIQRLA RSAA
|
| |