Gene Nmar_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1085 
Symbol 
ID5774497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp986566 
End bp989808 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content37% 
IMG OID641316727 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_001582419 
Protein GI161528593 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase
[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAA ACGAATCACT AAACAAAATT CTTGTTTTGG GTAGTGGCGC CATTAAAATT 
GGTGAAGCCG GAGAATTTGA TTATTCTGGA AGTCAGTGTC TTAAAGCAAT TCATGAAGAC
GGACTCAAAA GCGTTCTAAT TAATCCAAAC ATTGCAACAA TTCAAACTGA TACAAGATTT
GCAGACCAAG TGTATCTTTT ACCTGTTAAT CCACAATATG TTGAATCTGT TGTAGAAAAA
GAAAGACCTG ATGGAATCAT GTTGGCCTAT GGTGGTCAGA CTGCACTGAA TTGTGGTGTT
AAACTAGAAG AAGCGGGAAT TTTGCAAAAA TATGGTGTCA AAGTGCTTGG AACTCAGGTT
CAAGGTATCA AAAATACTGA AGATAGGCAG CTTTTCAAGG ATCAAATGAC TGAAGCAGGC
GTTCCTGTAC TCAAGAGTAA AACTGTGACT AATGTCGATG ATGCAAAGAA GGTAGCAGAG
GAACTGAACT ACCCTGTCAT TGTCCGAGTA GCCTATACCC TAGGAGGTCG TGGCGGAGGA
ATTGCCCATA ATGAAATTGA ACTCCATGAA ATCGTTGAAC GTGGTCTTAA TGCAAGTCTT
GTAGGCCAAG TTCTAGTTGA AGAATACATT GGTCATTGGA AACAAATCGA ATATGAAGTA
ATGCAAGACT ATGATGGCAA TAATGTAATT GTCTGTAACA TGGAAAATGT TCTTTCAATG
AAAGTTCACA CTGGTGATAA CATTGTAGTT GCACCTTCTC AAACAATCAA CAACCATGAA
TATCACATGT TGCGTTCAGC TGCGTTACGT GCAACAAAAC ATGTTGGAAT TGTTGGTGAA
TGTAATATTC AATATGCATT AGATTCAGAT TCAGACAGAT ATGTTGCAAT CGAAATCAAC
CCTCGTCTAT CTCGTTCTTC TGCACTTGCA AGTAAAGCTA CAGGCTATCC ACTTGCATAC
ATGTCTGCAA AAATTGGATT AGGTTATAGT TTGTCTGAAC TAGTAAACAG AATTACAAAA
AGTACAACTG CATGCTTTGA GCCTTCACTT GATTATGTTG TTTGCAAACA TCCAAGATGG
GACTTTTCAA AGTTTGAACA AGTCAATAGA AAACTTGGAG TTACGATGAA ATCCGTTGGT
GAAGTCATGG CAGTTGGACG AACATTTGAA GAATCATTAC AAAAAGCAAT CAGAATGCTT
GCAATTGGAA ATGATGGATT GGTATTGAAT CGTGCTAATG GCAAAAAATA CACTGAAGAA
GAAATTGAAT TCAAATTATC TCATCATGAT GATGAGATTT TGTACAATGT TGCAATTGCC
TTGAAAATGG GGATTTCAGT TGAGAGAATC TACAAACTTT CTGCAATTGA TCCTTGGTTC
ATTGATAAAA TACAAAATAT CCTTAACGCG GAGGCCAAAA TCAAGGAATC TGAACTAGAC
AAATCCTTGA TGTGGGATAT CAAAAAACTA GGCTTCTCTG ATAATCAAAT TGCCCGTGCA
AAAGGAAGCA CTCCTGATGA AGTGCGTGAA ATACGCAAGG AATTAGGCGT GGTTCCATCT
GTAAAGCAGA TTGACACCCT TGCAGCAGAA TGGCCTGCAG TTACCAATTA TCTATACCTA
ACATATGGTG GACACTCTCA TGACATTGAA ATTCCAAAAG ATGATCCAGG AATTGTTGTA
GTTGGTGCGG GACCATATAG AATCGGTAGT AGCGTAGAGT TTGATTGGGG AACAGTAAAC
ATGGTTTGGG GATTGCAAGA GAATGGAGAA AAGAATGTCT CAGTTGTAAA CTGTAATCCT
GAAACAGTAT CAACTGATTA TGATATCTGT ACAAGACTGT ACTTTGAAGA ACTTACACAA
GAAAGATTAC TTGACATTAC TGACTTTGAG AATCCAAAAG GAGTCATTAC ATGTGTAGGT
GGACAAACAG CAAACAATCT GACTCCTGGA CTAGCAGAAC GTGGAATCAA TATTTTAGGA
ACATCAGCAA AAGATGTTGA CAGAGCTGAA GACCGTTCAA AGTTTAGTGC AGAATTAGAT
AAACTACACA TTGGTCAACC AAGATGGCAA GCGTTCTCAA ACCTTAATGA AGCAAAATCA
TTTGCACAAG AAGTGGGATT TCCTGTAATA GTTAGACCAT CTTATGTTTT ATCTGGAGCT
GCAATGAAAG TAGTTTGGTC GCAAGAAGAA CTCAAAACAT ATGTCAAAGA AGCAACTGAT
GTATCCCCTG ATCATCCGGT TGTAATTTCA AAATTCATGT TAAACTCATT AGAAGTTGAT
GTTGATGGAA TCAGTAATGG AAAAGAAGTT GTTATTGGCG CAATAGTTGA ACATATTGAT
TCAGCTGGTG TACACTCTGG TGATGCAATG ATGTGTATTC CTCCATGGAG ATTAAGCAAC
AAAATTATCG AAACAATTAC TGATTATACT AAACGAATTG CATTGACCTT TAATGTTAAA
GGGCCATTTA ACCTGCAATT CTTGATAAAC AATGATCAAG TCTATGTTAT AGAACTGAAC
ATACGTGCAT CACGTTCTAT GCCATTTGTC TCAAAATTAG TCAAAATGAA CCTAATTTCA
CTTGCCTCAA AGGCTATTTT GGACAAACCG TTACCTAAAA TCCCTGAAAA CAAGTGGCAG
AAAATCCATA ATTATGGAAT CAAAGTTCCA CAATTTTCAT TCATGCAGCT AGATGGTGCA
GACATTGCAT TGGGTGTAGA GATGCAGTCT ACTGGTGAAG CTGCTTGCTT TGGAAATAGC
TTCCATGATG CACTTGCAAA AGGTTTGACA TCAGTTGGAA TCAAACTACC TCAAACTGGA
ACTGCAGTTG TTACTGTTGG GGGAACAGAA AACAAGGAGA AATTATTATC TTCAATTGCA
AAACTAAAAC AATTAGGATT CAAGATTATG GCAACAGAAC ATACTGCAGA ATTCTTTGAA
GAAAAAGTTG GTGGTATAGA AATCATTCAC AAGATTTCAG AACCTGAACG TCTACCAAAC
ATTGCAGATA TGCTTTATGA AAGGAAGATA GACTTTATCA TAAACATCCC AAGTACTTCT
ACAATTGAAA AATATGTTGG AATGCTTGAT GATGAATATC AAATTAGAAG AAAGGCAATT
GAACTTGGAA TTCCAGTGCT AACTACAATA GAACTTGCTG ATTCATTTGT TAAGACCCTT
GAATGGTTAC AACATAATGA AACAACAAAA GATCCTATTG AACCATATGA CCCTATTGAA
TAA
 
Protein sequence
MPKNESLNKI LVLGSGAIKI GEAGEFDYSG SQCLKAIHED GLKSVLINPN IATIQTDTRF 
ADQVYLLPVN PQYVESVVEK ERPDGIMLAY GGQTALNCGV KLEEAGILQK YGVKVLGTQV
QGIKNTEDRQ LFKDQMTEAG VPVLKSKTVT NVDDAKKVAE ELNYPVIVRV AYTLGGRGGG
IAHNEIELHE IVERGLNASL VGQVLVEEYI GHWKQIEYEV MQDYDGNNVI VCNMENVLSM
KVHTGDNIVV APSQTINNHE YHMLRSAALR ATKHVGIVGE CNIQYALDSD SDRYVAIEIN
PRLSRSSALA SKATGYPLAY MSAKIGLGYS LSELVNRITK STTACFEPSL DYVVCKHPRW
DFSKFEQVNR KLGVTMKSVG EVMAVGRTFE ESLQKAIRML AIGNDGLVLN RANGKKYTEE
EIEFKLSHHD DEILYNVAIA LKMGISVERI YKLSAIDPWF IDKIQNILNA EAKIKESELD
KSLMWDIKKL GFSDNQIARA KGSTPDEVRE IRKELGVVPS VKQIDTLAAE WPAVTNYLYL
TYGGHSHDIE IPKDDPGIVV VGAGPYRIGS SVEFDWGTVN MVWGLQENGE KNVSVVNCNP
ETVSTDYDIC TRLYFEELTQ ERLLDITDFE NPKGVITCVG GQTANNLTPG LAERGINILG
TSAKDVDRAE DRSKFSAELD KLHIGQPRWQ AFSNLNEAKS FAQEVGFPVI VRPSYVLSGA
AMKVVWSQEE LKTYVKEATD VSPDHPVVIS KFMLNSLEVD VDGISNGKEV VIGAIVEHID
SAGVHSGDAM MCIPPWRLSN KIIETITDYT KRIALTFNVK GPFNLQFLIN NDQVYVIELN
IRASRSMPFV SKLVKMNLIS LASKAILDKP LPKIPENKWQ KIHNYGIKVP QFSFMQLDGA
DIALGVEMQS TGEAACFGNS FHDALAKGLT SVGIKLPQTG TAVVTVGGTE NKEKLLSSIA
KLKQLGFKIM ATEHTAEFFE EKVGGIEIIH KISEPERLPN IADMLYERKI DFIINIPSTS
TIEKYVGMLD DEYQIRRKAI ELGIPVLTTI ELADSFVKTL EWLQHNETTK DPIEPYDPIE