Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1557 |
Symbol | |
ID | 5774360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1424527 |
End bp | 1425597 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641317209 |
Product | class I/II aminotransferase |
Protein accession | YP_001582891 |
Protein GI | 161529065 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT CGTGGTATGA AAAAAAATTA GAAGACTTTG CAAAACTAGG TGGTTACAAA AAACCTGAAA AATTTGATGA TGTTTTAAAA CTGGATTCAA ATGAGAATTT TGTAATAAGC AAAAAATTCC AACAAGATGT TATTGCTTAT GCAAAATCAA ATTCTGATGT AAGAGAATAT CCATTAGGAG GAGTTGAAAA ATTAGTATCA AAACTAGCAA AATATCTCAA AGTTTCTGAA AATATGATTG GTGTTGGAAA TGGTTCAGAC CAAATCTTGG ATCTGTTTTT AGCAAATATG GCCTCAAAGA AAACTAGGAT TTTAACATCT GATCCAACAT TTGGGTTCTT TGAAGAACGA TGCAAACTAT ATGCTATTCC TTGTACTAAA ATTCCATTTT CATCTGACAT GAAACTTGAT ATTGAAAAAT TCAATTCAAA CTTGAAAAAA TGTCACATTC TTTATTTGGA TTCTCCAAAC AATCCTACTG GATTTCAATT CTCAAAAGCT CAACTAGAAT CTTTAATCAA AAAATTTGAT GGATTGGTAA TCATTGATGA GGCATATGGT GAATTTGGTG ACTCATCAAT TGTTTCATTG ACAAAAAAAT ATGATAACCT AATTGTTGTC AAAACATTTT CCAAAGCATT TGGACTTGCA GGTTTACGTA TAGGATACTT TGTTGCAAAC AAGAAAATAG TTGAAGTCTT TAACCAAGTA TTACAATATC CGTATCCTCT CAATACATTG GCAATCGAGG CTGGAATTGC CTCTTTGGAC AAAGTTGATC AAATGAAAGA GGCATCTGAA ATCATCAAAT CTGAGCGTAA AAAAATTATT GAGAATTTGC GCAAGTATGA TGCCTTTACT GTTTTTGATT CCAATGCTAA TTTTGTACTA TTTGATGCTA AAGGCGCAGA CAAACGCGTA TTCTCTGCAC TAGTAGAACA AGGGATTTCT ATACGCAAAC TAGGTAAGAT TGGTTCACAT TCAGGATGCT TGAGAGTTAC TGTTGGAACC AAGGAAATGA ATTCAAAATT CCTTTTAGCA ATACGTGATC TTTTAGGATA A
|
Protein sequence | MKNSWYEKKL EDFAKLGGYK KPEKFDDVLK LDSNENFVIS KKFQQDVIAY AKSNSDVREY PLGGVEKLVS KLAKYLKVSE NMIGVGNGSD QILDLFLANM ASKKTRILTS DPTFGFFEER CKLYAIPCTK IPFSSDMKLD IEKFNSNLKK CHILYLDSPN NPTGFQFSKA QLESLIKKFD GLVIIDEAYG EFGDSSIVSL TKKYDNLIVV KTFSKAFGLA GLRIGYFVAN KKIVEVFNQV LQYPYPLNTL AIEAGIASLD KVDQMKEASE IIKSERKKII ENLRKYDAFT VFDSNANFVL FDAKGADKRV FSALVEQGIS IRKLGKIGSH SGCLRVTVGT KEMNSKFLLA IRDLLG
|
| |