Gene Nmar_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1072 
Symbol 
ID5774528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp944015 
End bp945715 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content38% 
IMG OID641316714 
Productacetolactate synthase large subunit, biosynthetic type 
Protein accessionYP_001582406 
Protein GI161528580 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA TGGAAAATAT GATTGGTGCA AAGGCCTTGA TGACTGCAAT GGAAAAAGAA 
GGCGTCAAGG AAGTATTTGG TTTACCTGGA GGTGCAAATC TTCCAATGTA TGATGAATTT
GCTAGATGTG ATATTAGACA TATTTTGGCA AGACATGAAC AATCTGCAGC TCATATGGCT
GATGGATTTG GAAGAGTAAG TCGAAAGCCT GGAGTTTGTT TTGCAACATC TGGACCTGGC
GCTACCAATA TTCTTACTGG CATTGCAACA GCACAAGCTG ACTCTGCACC AATGGTTGCA
GTAACTGGAC AAGTTCCTAC TCCTATGATC GGAAAAGATG CATTTCAAGA AAGTGATATT
ATTGGAATGG CAAATCCTGT TGTAAAATAT GCATTTCAAC CAAGACATGC TTCAGAAATC
CCTGAAGTTG TAAAGAAAGG ATTCTTCATT GCAGAAACTG GAAGACCTGG ACCTGTTTTA
ATTGACATTC CAAAAGATGT GCAACAAAAC GAAGCTGAAA TGGTTTTTCC TGATGAATTT
AAAATTCAAG GTTATCATCC TTGGGCTGAT CCTGATATTG TTGCAGTTGA AAAAGCAATT
GATATGTTAC TTCATGCTCA AAAACCTGTA ATCTTAGCTG GTGGTGGAAC AATCATTTCA
TCCGCATTTG CAGAATTACA AGCTGTTGCT GAAGCATTAA TGCTTCCTGT AGTTACTACC
TTCAAAGGAA AAGGCGCATT TCCTGAAAAC CATCCTTTAT CATTAGGGCC AATTGGAATG
CATGGTCATG CAGAAGCAAA CAAAATCATG ACTGAGGCTG ATTGTGTTTT AGCAATTGGT
ACCAGATTTT CTGACAGGTC TGTTGGAACT TTTGAAGAGT TTGAGAAAAA TCTAAAGATT
ATCCACATGG ATGTTGATCC TGCTGAAATT GGTAAAAACC AAACAGCACA ACTTGCAGTA
GTTGGTGATG TACAGATGAA TCTTAGAATT ATGGTAAAGT TACTTTTGCA AAAAGCAATC
AAAAAGACTG ACGAGACCCC TTGGGTAAAA CATGTTAAGG AAACTAAAGC CTATTGGCAA
GAAAATTTGA AATTACATCC TGGTGAAATG GGTGCTGCAA AAATTTTACG TAAATTAAGA
GAGATATTAC CAAAAGAATC TATCATCACA ACAGAAGTAG GACAACATCA AATGTGGGCA
TCCTTATTTT ATGATGTAAT TCAACCTGGA ACTTTCTTTA GTTCTACTGG ACTTGGTACT
ATGGGTTGGG GATTCCCAGC AGCAATTGGT GCCAAAGTGG CAAAACCTGA TGTTCCTGTT
GTAGATATTG CAGGAGATGG TAGTTTTGCA ATGACTGAAA ACTCTCTTGC AACTGCAGTG
CTAGAAGACA TTCCTGTAAT TGTATTTGTA ATGAACAACT TTACATTGGG AATGGTAGCT
CAATGGCAAA GAACATTTTA TGAAAGAAGA ATGATTGGAG TGGACCAGGG AAAATGGTGT
CCTGATTATG TTAAATTAGC AGAATCTTAT GGAGCTCAAG GAATAAGAGC ACAATCCATG
GATGAACTTG ATAAGGCAAT CAAAGACGCA TTGAGTAGTG ATGTTGCAAC AGTAATTGAT
ATTCCAATTG ATCCTGAAGA AGACGTATTG CCATTTGTAG CTCCTGGAAC TTCGCTTAAG
GATATGATAT TACCATCATA G
 
Protein sequence
MSDMENMIGA KALMTAMEKE GVKEVFGLPG GANLPMYDEF ARCDIRHILA RHEQSAAHMA 
DGFGRVSRKP GVCFATSGPG ATNILTGIAT AQADSAPMVA VTGQVPTPMI GKDAFQESDI
IGMANPVVKY AFQPRHASEI PEVVKKGFFI AETGRPGPVL IDIPKDVQQN EAEMVFPDEF
KIQGYHPWAD PDIVAVEKAI DMLLHAQKPV ILAGGGTIIS SAFAELQAVA EALMLPVVTT
FKGKGAFPEN HPLSLGPIGM HGHAEANKIM TEADCVLAIG TRFSDRSVGT FEEFEKNLKI
IHMDVDPAEI GKNQTAQLAV VGDVQMNLRI MVKLLLQKAI KKTDETPWVK HVKETKAYWQ
ENLKLHPGEM GAAKILRKLR EILPKESIIT TEVGQHQMWA SLFYDVIQPG TFFSSTGLGT
MGWGFPAAIG AKVAKPDVPV VDIAGDGSFA MTENSLATAV LEDIPVIVFV MNNFTLGMVA
QWQRTFYERR MIGVDQGKWC PDYVKLAESY GAQGIRAQSM DELDKAIKDA LSSDVATVID
IPIDPEEDVL PFVAPGTSLK DMILPS