Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1072 |
Symbol | |
ID | 5774528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 944015 |
End bp | 945715 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641316714 |
Product | acetolactate synthase large subunit, biosynthetic type |
Protein accession | YP_001582406 |
Protein GI | 161528580 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATA TGGAAAATAT GATTGGTGCA AAGGCCTTGA TGACTGCAAT GGAAAAAGAA GGCGTCAAGG AAGTATTTGG TTTACCTGGA GGTGCAAATC TTCCAATGTA TGATGAATTT GCTAGATGTG ATATTAGACA TATTTTGGCA AGACATGAAC AATCTGCAGC TCATATGGCT GATGGATTTG GAAGAGTAAG TCGAAAGCCT GGAGTTTGTT TTGCAACATC TGGACCTGGC GCTACCAATA TTCTTACTGG CATTGCAACA GCACAAGCTG ACTCTGCACC AATGGTTGCA GTAACTGGAC AAGTTCCTAC TCCTATGATC GGAAAAGATG CATTTCAAGA AAGTGATATT ATTGGAATGG CAAATCCTGT TGTAAAATAT GCATTTCAAC CAAGACATGC TTCAGAAATC CCTGAAGTTG TAAAGAAAGG ATTCTTCATT GCAGAAACTG GAAGACCTGG ACCTGTTTTA ATTGACATTC CAAAAGATGT GCAACAAAAC GAAGCTGAAA TGGTTTTTCC TGATGAATTT AAAATTCAAG GTTATCATCC TTGGGCTGAT CCTGATATTG TTGCAGTTGA AAAAGCAATT GATATGTTAC TTCATGCTCA AAAACCTGTA ATCTTAGCTG GTGGTGGAAC AATCATTTCA TCCGCATTTG CAGAATTACA AGCTGTTGCT GAAGCATTAA TGCTTCCTGT AGTTACTACC TTCAAAGGAA AAGGCGCATT TCCTGAAAAC CATCCTTTAT CATTAGGGCC AATTGGAATG CATGGTCATG CAGAAGCAAA CAAAATCATG ACTGAGGCTG ATTGTGTTTT AGCAATTGGT ACCAGATTTT CTGACAGGTC TGTTGGAACT TTTGAAGAGT TTGAGAAAAA TCTAAAGATT ATCCACATGG ATGTTGATCC TGCTGAAATT GGTAAAAACC AAACAGCACA ACTTGCAGTA GTTGGTGATG TACAGATGAA TCTTAGAATT ATGGTAAAGT TACTTTTGCA AAAAGCAATC AAAAAGACTG ACGAGACCCC TTGGGTAAAA CATGTTAAGG AAACTAAAGC CTATTGGCAA GAAAATTTGA AATTACATCC TGGTGAAATG GGTGCTGCAA AAATTTTACG TAAATTAAGA GAGATATTAC CAAAAGAATC TATCATCACA ACAGAAGTAG GACAACATCA AATGTGGGCA TCCTTATTTT ATGATGTAAT TCAACCTGGA ACTTTCTTTA GTTCTACTGG ACTTGGTACT ATGGGTTGGG GATTCCCAGC AGCAATTGGT GCCAAAGTGG CAAAACCTGA TGTTCCTGTT GTAGATATTG CAGGAGATGG TAGTTTTGCA ATGACTGAAA ACTCTCTTGC AACTGCAGTG CTAGAAGACA TTCCTGTAAT TGTATTTGTA ATGAACAACT TTACATTGGG AATGGTAGCT CAATGGCAAA GAACATTTTA TGAAAGAAGA ATGATTGGAG TGGACCAGGG AAAATGGTGT CCTGATTATG TTAAATTAGC AGAATCTTAT GGAGCTCAAG GAATAAGAGC ACAATCCATG GATGAACTTG ATAAGGCAAT CAAAGACGCA TTGAGTAGTG ATGTTGCAAC AGTAATTGAT ATTCCAATTG ATCCTGAAGA AGACGTATTG CCATTTGTAG CTCCTGGAAC TTCGCTTAAG GATATGATAT TACCATCATA G
|
Protein sequence | MSDMENMIGA KALMTAMEKE GVKEVFGLPG GANLPMYDEF ARCDIRHILA RHEQSAAHMA DGFGRVSRKP GVCFATSGPG ATNILTGIAT AQADSAPMVA VTGQVPTPMI GKDAFQESDI IGMANPVVKY AFQPRHASEI PEVVKKGFFI AETGRPGPVL IDIPKDVQQN EAEMVFPDEF KIQGYHPWAD PDIVAVEKAI DMLLHAQKPV ILAGGGTIIS SAFAELQAVA EALMLPVVTT FKGKGAFPEN HPLSLGPIGM HGHAEANKIM TEADCVLAIG TRFSDRSVGT FEEFEKNLKI IHMDVDPAEI GKNQTAQLAV VGDVQMNLRI MVKLLLQKAI KKTDETPWVK HVKETKAYWQ ENLKLHPGEM GAAKILRKLR EILPKESIIT TEVGQHQMWA SLFYDVIQPG TFFSSTGLGT MGWGFPAAIG AKVAKPDVPV VDIAGDGSFA MTENSLATAV LEDIPVIVFV MNNFTLGMVA QWQRTFYERR MIGVDQGKWC PDYVKLAESY GAQGIRAQSM DELDKAIKDA LSSDVATVID IPIDPEEDVL PFVAPGTSLK DMILPS
|
| |