Gene Nmar_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0214 
Symbol 
ID5774542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp190720 
End bp192114 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content36% 
IMG OID641315834 
Productdihydropyrimidinase 
Protein accessionYP_001581548 
Protein GI161527722 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTATG ATGCTGTAAT TGTTGATTCA CACGTTATAC TCCCGCAAGG AATGGTGGAT 
AAAAACATCG TAATTGATGA TGGAAAAGTA GTTAGCCTCA CAAATGATGT TCCTGCATGT
GATAACAAAA TCAATGGAAA TGGACTGATT TCTGTTCCTG GTCCAATTGA TACACATGTT
CACTATGGTG TTTATTCTCC AATCAACGAG GCAGCAAAGA CAGAATCTCA TGCTGCTGCA
ATTGGTGGTG TTACAACTAT GATGAGGATG CTTAGATTGG GAGACCCATT TTCTAATTCA
CTCCAAGATC AATTAGATGC TGCATCTCAA AATCATTATG TTGACTATGC AATACATGCA
TCTGTTTTTA CTCCTCAACA AATTAATGAA ATGGATTACT GTGTGAAAAA AGGAATTACC
TCATTCAAAA TTTACATGAA TCTTGGCGGC GAAATTGGTC ATGTTTACAT GGATATGCCA
CCAAATTCTT CTAAACTTGT TGCAGCTAAC GTTGATGTAA CTGATGAGAT TGTTGAGCAG
ACTGTAAAAA CTGCAGCTTC TCTTGGATGT CCTGTTTTGG TTCATGCAGA AGATTATGAA
TCCTGTGGAT GTGGAATTCA AACTGCACGA GAGAAAAACC AAGATGGATT GTCTGCATGG
TCTGAAAGTC GTTCCCCTGA ATTTGAAGCA AAAGCCATCA AAACTGTTTC AAAGTTTGGA
CGCGATTATG ATTGTGTTAT CTATTTTGTT CATATTGGTT CAGAACAAGC TCTAAAACAA
ATTCAAGAAG AAAAAAAACT AGGAACAAAG ATTTTTGTCG AAACATGTCC TCACTATTTG
ACATTATCTT ATGAAAAACA AAATGGCTAT CTTGCTAAAG TAATGCCTCC AATACGAACA
GAAACTGACC GACAAGCAGT ATGGGGTGCA CTTTCTTCAA ATCAAATTGA TACTATAGGC
ACTGATCATG TTGCAAATCA ACTCAAACTC AAATTGGGTG GAGATGATGT ATGGTCTGCA
TTAGCTGGAT TTCCAGGAAT AGGAACTGTA CTTCCTATTG TACTCAATGA TGGAGTAAAC
CAGAATAAAA TTACTTTAGA GCAATTTGTG CGCTTTACAA GCCAAAATGC CGCACAAATA
TTTGGAATGT ATCCTCAAAA AGGTACTCTT GAAAAGAATT CTGATGCTGA TGTTACTATG
ATTGATCTGA AAAAAGAAAA GAAAGTAACA TCTGAATTAT TTGGTGGGTT TTCAGATTAC
ATCGTTTATG AAGGTAGAAA TTTGAAAGGC TGGCCTGTTA AAACAATAGT ACGAGGAGAA
TTAATTGCTG AAGACTTTGA AGTTATTGGA AAACTTGGTC ATGGGAAACT TGTTCCACGA
GCAATTCCTA AATAA
 
Protein sequence
MTYDAVIVDS HVILPQGMVD KNIVIDDGKV VSLTNDVPAC DNKINGNGLI SVPGPIDTHV 
HYGVYSPINE AAKTESHAAA IGGVTTMMRM LRLGDPFSNS LQDQLDAASQ NHYVDYAIHA
SVFTPQQINE MDYCVKKGIT SFKIYMNLGG EIGHVYMDMP PNSSKLVAAN VDVTDEIVEQ
TVKTAASLGC PVLVHAEDYE SCGCGIQTAR EKNQDGLSAW SESRSPEFEA KAIKTVSKFG
RDYDCVIYFV HIGSEQALKQ IQEEKKLGTK IFVETCPHYL TLSYEKQNGY LAKVMPPIRT
ETDRQAVWGA LSSNQIDTIG TDHVANQLKL KLGGDDVWSA LAGFPGIGTV LPIVLNDGVN
QNKITLEQFV RFTSQNAAQI FGMYPQKGTL EKNSDADVTM IDLKKEKKVT SELFGGFSDY
IVYEGRNLKG WPVKTIVRGE LIAEDFEVIG KLGHGKLVPR AIPK