Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1551 |
Symbol | |
ID | 3785273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1777319 |
End bp | 1778638 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811639 |
Product | homoserine dehydrogenase |
Protein accession | YP_412246 |
Protein GI | 229137830 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCTA TCCATGTGGG CCTGTTGGGG GCAGGCACGG TCGGCAGCGG CACGTTCGCC GTGCTCAAAC GCAATCAGGA AGAGATCACC CGCCGCGCCG GCCGCAGTAT CGTCGTTCGC ATGATTGCTG ATCGAGAGGA AGAGAAGGCC CGTCAGATCG CAGGCGACGA TGAGGTCATC GTCACACGCG ATGCGAATGA GGTGGTGATG AATCCCGATA TCGATATCGT GGTCGAACTG ATCGGCGGCT ACACCGCGGC CAGGGACCTG ATACTGAAGG CTATCGAGAA TGGCAAGCAT GTCATTACCG CCAACAAGGC ATTGCTTGCT TCACATGGGA CCGAAATCTT CGCCGCGGCG CAGAAGAAAG GCGTAATGGT GGCGTTCGAA GCGGCGGTGG CGGGAGGCAT TCCTATAATC AAGGCTCTGC GTGAGGGATT GACTGCCAAC CGCATCGAGT GGATAGCCGG CATCATCAAT GGCACGAGCA ACTTCATTCT CTCGGAAATG CGGGATAAGG GGTTGACGTT CGAAACCGTA TTGAAGCAGG CGCAAAAACT GGGTTATGCC GAAGCCGATC CCACTTTTGA CATCGAGGGC ATCGATGCGG CGCATAAACT CACGATCATG GCCTCGATCG CCTTTGGCAT TCCAATGCAG TTTGACAAGG TATATACCGA GGGTATAACC AAATTGACCC GCGAGGATAT TCGTTATGCG GAGGAACTGG GTTATCGCAT CAAGCTGTTG GGCATTACGA AACGTACGTC CGGAGGAATC GAGTTGCGTG TGCATCCGAC ACTCATCCCT GCTCGAAGAC TGATCGCCAA TGTCGAGGGG GTGATGAATG CCATCGTGGT GAGAGGCGAT GCGGTAGGCT CTACCCTCTA TTATGGTCCG GGAGCGGGTG CCGAACCTAC AGGGAGTTCA GTCGTGGCAG ACCTGGTGGA TGTAACTCGC ATGCACACAG CCGATCCCAA GCACCGCGTT CCTCATCTCG CCTTCCAGCC AGGCCGCCTG TCGGATACGC CGATCCTCAC GATGGACGAG GTGGAAACGT CTTATTACCT GCGGCTGCGG GTCATGGACA AACCTGGGGC CCTGGCCGAT ATCACGCGGG TGCTTGCGGA CCTCGGCATT TCCATCGAAG CCATGATGCA GAAAGAGCCA AGCGAAGGCG AAGACCAGGT GGATATCATT ATGCTCACGC ATTTGGCGGT GGAAAGAAAC GTTAACGATG CGATCGCCCG AATAAAGCGA TTGCCCATAA CGACCGGCAA GGTGACCCGC ATCCGGCTGG AGCATCTGGG CAGCAAATAA
|
Protein sequence | MKPIHVGLLG AGTVGSGTFA VLKRNQEEIT RRAGRSIVVR MIADREEEKA RQIAGDDEVI VTRDANEVVM NPDIDIVVEL IGGYTAARDL ILKAIENGKH VITANKALLA SHGTEIFAAA QKKGVMVAFE AAVAGGIPII KALREGLTAN RIEWIAGIIN GTSNFILSEM RDKGLTFETV LKQAQKLGYA EADPTFDIEG IDAAHKLTIM ASIAFGIPMQ FDKVYTEGIT KLTREDIRYA EELGYRIKLL GITKRTSGGI ELRVHPTLIP ARRLIANVEG VMNAIVVRGD AVGSTLYYGP GAGAEPTGSS VVADLVDVTR MHTADPKHRV PHLAFQPGRL SDTPILTMDE VETSYYLRLR VMDKPGALAD ITRVLADLGI SIEAMMQKEP SEGEDQVDII MLTHLAVERN VNDAIARIKR LPITTGKVTR IRLEHLGSK
|
| |