Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0696 |
Symbol | |
ID | 5773236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 635915 |
End bp | 636925 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316332 |
Product | homoserine dehydrogenase |
Protein accession | YP_001582030 |
Protein GI | 161528204 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGAATAA TATTATGTGG ATTTGGCGTT GTTGGCCAAA GTTTAGTAAA ATTATTTGAA TCCAGAGCAG AAGACCTGTA TGCAAAGTAT GGACTCAAAC CTAGAGTAGT GGGAGTGTTT GACAGTAAAG GGAGTGCAAT GGATTCTTCA GGATTAGAGT TAAACAAACT CATCGAAGTT AAGAAAAAAT TTGGAACTGT AAAAAATTAC GCTGATACAA AAAATACAAT GTCAGGTATT GACATGCTCA AAAATGTAGA GGCAGATGTG CTCATTGAAA CTACTGCAAG CAACTACAAA GATGCTGAAC CTGGAATGAC ACACATTATC ACTGCAATGA AAAAAGGAAT GCATGTAATA TCAGTAAACA AAGGACCTCT GGCACTAGCA TTTCCATCAT TAATGGAGCT TGCAACATAC AATCGAGTCA TGTTCAAATT TAGTGGAACC GTAGGTGGGG GAACACCAAT TCTAGATTAT GCAAAAAACA GTCTTAGTGG CGAAAGAATC ACATCATTTG CAGGCATTCT AAATGGAACT ACAAACTATA TCCTAACAAA CATGGCAACA GGAGTTTCAT ATGAAGATGC ACTAAAAGAT GCCCAAGACA AGGGCTACGT AGAGGCAGAT GAAGCATTAG ATTTAGACGG ACTTGACGCT GCAGCCAAAT TAGTGATTCT TGCAAATTGG ATTATGGGAA TGAAAGTTAC AATGCCAGAC ATTAATTGTA CTGGAATTCG CAAAGTAACC ACAGAAGATA TCAAAAAAGC AGAGAAAAAC AATTGTGCAG TAAAACTGAT TGCATCATGC AATAAAGAAT TGATAGTTGC TCCAAAAGAA ATTCCAAATG ATGATCCATT ATGTGTAAAT GGTACACTTA ATGCAATTGC ATTTACATCA GAGCATTCAG GCACACAGAC AATTATTGGA CGTGGTGCAG GAGGCATGGA GACTGCAAGT TCCATACTAA GAGATTTGCT AGACATTAGA CAAGAGATTG CCAGAACTTG A
|
Protein sequence | MRIILCGFGV VGQSLVKLFE SRAEDLYAKY GLKPRVVGVF DSKGSAMDSS GLELNKLIEV KKKFGTVKNY ADTKNTMSGI DMLKNVEADV LIETTASNYK DAEPGMTHII TAMKKGMHVI SVNKGPLALA FPSLMELATY NRVMFKFSGT VGGGTPILDY AKNSLSGERI TSFAGILNGT TNYILTNMAT GVSYEDALKD AQDKGYVEAD EALDLDGLDA AAKLVILANW IMGMKVTMPD INCTGIRKVT TEDIKKAEKN NCAVKLIASC NKELIVAPKE IPNDDPLCVN GTLNAIAFTS EHSGTQTIIG RGAGGMETAS SILRDLLDIR QEIART
|
| |