Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0214 |
Symbol | |
ID | 5774542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 190720 |
End bp | 192114 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641315834 |
Product | dihydropyrimidinase |
Protein accession | YP_001581548 |
Protein GI | 161527722 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTATG ATGCTGTAAT TGTTGATTCA CACGTTATAC TCCCGCAAGG AATGGTGGAT AAAAACATCG TAATTGATGA TGGAAAAGTA GTTAGCCTCA CAAATGATGT TCCTGCATGT GATAACAAAA TCAATGGAAA TGGACTGATT TCTGTTCCTG GTCCAATTGA TACACATGTT CACTATGGTG TTTATTCTCC AATCAACGAG GCAGCAAAGA CAGAATCTCA TGCTGCTGCA ATTGGTGGTG TTACAACTAT GATGAGGATG CTTAGATTGG GAGACCCATT TTCTAATTCA CTCCAAGATC AATTAGATGC TGCATCTCAA AATCATTATG TTGACTATGC AATACATGCA TCTGTTTTTA CTCCTCAACA AATTAATGAA ATGGATTACT GTGTGAAAAA AGGAATTACC TCATTCAAAA TTTACATGAA TCTTGGCGGC GAAATTGGTC ATGTTTACAT GGATATGCCA CCAAATTCTT CTAAACTTGT TGCAGCTAAC GTTGATGTAA CTGATGAGAT TGTTGAGCAG ACTGTAAAAA CTGCAGCTTC TCTTGGATGT CCTGTTTTGG TTCATGCAGA AGATTATGAA TCCTGTGGAT GTGGAATTCA AACTGCACGA GAGAAAAACC AAGATGGATT GTCTGCATGG TCTGAAAGTC GTTCCCCTGA ATTTGAAGCA AAAGCCATCA AAACTGTTTC AAAGTTTGGA CGCGATTATG ATTGTGTTAT CTATTTTGTT CATATTGGTT CAGAACAAGC TCTAAAACAA ATTCAAGAAG AAAAAAAACT AGGAACAAAG ATTTTTGTCG AAACATGTCC TCACTATTTG ACATTATCTT ATGAAAAACA AAATGGCTAT CTTGCTAAAG TAATGCCTCC AATACGAACA GAAACTGACC GACAAGCAGT ATGGGGTGCA CTTTCTTCAA ATCAAATTGA TACTATAGGC ACTGATCATG TTGCAAATCA ACTCAAACTC AAATTGGGTG GAGATGATGT ATGGTCTGCA TTAGCTGGAT TTCCAGGAAT AGGAACTGTA CTTCCTATTG TACTCAATGA TGGAGTAAAC CAGAATAAAA TTACTTTAGA GCAATTTGTG CGCTTTACAA GCCAAAATGC CGCACAAATA TTTGGAATGT ATCCTCAAAA AGGTACTCTT GAAAAGAATT CTGATGCTGA TGTTACTATG ATTGATCTGA AAAAAGAAAA GAAAGTAACA TCTGAATTAT TTGGTGGGTT TTCAGATTAC ATCGTTTATG AAGGTAGAAA TTTGAAAGGC TGGCCTGTTA AAACAATAGT ACGAGGAGAA TTAATTGCTG AAGACTTTGA AGTTATTGGA AAACTTGGTC ATGGGAAACT TGTTCCACGA GCAATTCCTA AATAA
|
Protein sequence | MTYDAVIVDS HVILPQGMVD KNIVIDDGKV VSLTNDVPAC DNKINGNGLI SVPGPIDTHV HYGVYSPINE AAKTESHAAA IGGVTTMMRM LRLGDPFSNS LQDQLDAASQ NHYVDYAIHA SVFTPQQINE MDYCVKKGIT SFKIYMNLGG EIGHVYMDMP PNSSKLVAAN VDVTDEIVEQ TVKTAASLGC PVLVHAEDYE SCGCGIQTAR EKNQDGLSAW SESRSPEFEA KAIKTVSKFG RDYDCVIYFV HIGSEQALKQ IQEEKKLGTK IFVETCPHYL TLSYEKQNGY LAKVMPPIRT ETDRQAVWGA LSSNQIDTIG TDHVANQLKL KLGGDDVWSA LAGFPGIGTV LPIVLNDGVN QNKITLEQFV RFTSQNAAQI FGMYPQKGTL EKNSDADVTM IDLKKEKKVT SELFGGFSDY IVYEGRNLKG WPVKTIVRGE LIAEDFEVIG KLGHGKLVPR AIPK
|
| |