Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1085 |
Symbol | |
ID | 5774497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 986566 |
End bp | 989808 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316727 |
Product | carbamoyl-phosphate synthase large subunit |
Protein accession | YP_001582419 |
Protein GI | 161528593 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAA ACGAATCACT AAACAAAATT CTTGTTTTGG GTAGTGGCGC CATTAAAATT GGTGAAGCCG GAGAATTTGA TTATTCTGGA AGTCAGTGTC TTAAAGCAAT TCATGAAGAC GGACTCAAAA GCGTTCTAAT TAATCCAAAC ATTGCAACAA TTCAAACTGA TACAAGATTT GCAGACCAAG TGTATCTTTT ACCTGTTAAT CCACAATATG TTGAATCTGT TGTAGAAAAA GAAAGACCTG ATGGAATCAT GTTGGCCTAT GGTGGTCAGA CTGCACTGAA TTGTGGTGTT AAACTAGAAG AAGCGGGAAT TTTGCAAAAA TATGGTGTCA AAGTGCTTGG AACTCAGGTT CAAGGTATCA AAAATACTGA AGATAGGCAG CTTTTCAAGG ATCAAATGAC TGAAGCAGGC GTTCCTGTAC TCAAGAGTAA AACTGTGACT AATGTCGATG ATGCAAAGAA GGTAGCAGAG GAACTGAACT ACCCTGTCAT TGTCCGAGTA GCCTATACCC TAGGAGGTCG TGGCGGAGGA ATTGCCCATA ATGAAATTGA ACTCCATGAA ATCGTTGAAC GTGGTCTTAA TGCAAGTCTT GTAGGCCAAG TTCTAGTTGA AGAATACATT GGTCATTGGA AACAAATCGA ATATGAAGTA ATGCAAGACT ATGATGGCAA TAATGTAATT GTCTGTAACA TGGAAAATGT TCTTTCAATG AAAGTTCACA CTGGTGATAA CATTGTAGTT GCACCTTCTC AAACAATCAA CAACCATGAA TATCACATGT TGCGTTCAGC TGCGTTACGT GCAACAAAAC ATGTTGGAAT TGTTGGTGAA TGTAATATTC AATATGCATT AGATTCAGAT TCAGACAGAT ATGTTGCAAT CGAAATCAAC CCTCGTCTAT CTCGTTCTTC TGCACTTGCA AGTAAAGCTA CAGGCTATCC ACTTGCATAC ATGTCTGCAA AAATTGGATT AGGTTATAGT TTGTCTGAAC TAGTAAACAG AATTACAAAA AGTACAACTG CATGCTTTGA GCCTTCACTT GATTATGTTG TTTGCAAACA TCCAAGATGG GACTTTTCAA AGTTTGAACA AGTCAATAGA AAACTTGGAG TTACGATGAA ATCCGTTGGT GAAGTCATGG CAGTTGGACG AACATTTGAA GAATCATTAC AAAAAGCAAT CAGAATGCTT GCAATTGGAA ATGATGGATT GGTATTGAAT CGTGCTAATG GCAAAAAATA CACTGAAGAA GAAATTGAAT TCAAATTATC TCATCATGAT GATGAGATTT TGTACAATGT TGCAATTGCC TTGAAAATGG GGATTTCAGT TGAGAGAATC TACAAACTTT CTGCAATTGA TCCTTGGTTC ATTGATAAAA TACAAAATAT CCTTAACGCG GAGGCCAAAA TCAAGGAATC TGAACTAGAC AAATCCTTGA TGTGGGATAT CAAAAAACTA GGCTTCTCTG ATAATCAAAT TGCCCGTGCA AAAGGAAGCA CTCCTGATGA AGTGCGTGAA ATACGCAAGG AATTAGGCGT GGTTCCATCT GTAAAGCAGA TTGACACCCT TGCAGCAGAA TGGCCTGCAG TTACCAATTA TCTATACCTA ACATATGGTG GACACTCTCA TGACATTGAA ATTCCAAAAG ATGATCCAGG AATTGTTGTA GTTGGTGCGG GACCATATAG AATCGGTAGT AGCGTAGAGT TTGATTGGGG AACAGTAAAC ATGGTTTGGG GATTGCAAGA GAATGGAGAA AAGAATGTCT CAGTTGTAAA CTGTAATCCT GAAACAGTAT CAACTGATTA TGATATCTGT ACAAGACTGT ACTTTGAAGA ACTTACACAA GAAAGATTAC TTGACATTAC TGACTTTGAG AATCCAAAAG GAGTCATTAC ATGTGTAGGT GGACAAACAG CAAACAATCT GACTCCTGGA CTAGCAGAAC GTGGAATCAA TATTTTAGGA ACATCAGCAA AAGATGTTGA CAGAGCTGAA GACCGTTCAA AGTTTAGTGC AGAATTAGAT AAACTACACA TTGGTCAACC AAGATGGCAA GCGTTCTCAA ACCTTAATGA AGCAAAATCA TTTGCACAAG AAGTGGGATT TCCTGTAATA GTTAGACCAT CTTATGTTTT ATCTGGAGCT GCAATGAAAG TAGTTTGGTC GCAAGAAGAA CTCAAAACAT ATGTCAAAGA AGCAACTGAT GTATCCCCTG ATCATCCGGT TGTAATTTCA AAATTCATGT TAAACTCATT AGAAGTTGAT GTTGATGGAA TCAGTAATGG AAAAGAAGTT GTTATTGGCG CAATAGTTGA ACATATTGAT TCAGCTGGTG TACACTCTGG TGATGCAATG ATGTGTATTC CTCCATGGAG ATTAAGCAAC AAAATTATCG AAACAATTAC TGATTATACT AAACGAATTG CATTGACCTT TAATGTTAAA GGGCCATTTA ACCTGCAATT CTTGATAAAC AATGATCAAG TCTATGTTAT AGAACTGAAC ATACGTGCAT CACGTTCTAT GCCATTTGTC TCAAAATTAG TCAAAATGAA CCTAATTTCA CTTGCCTCAA AGGCTATTTT GGACAAACCG TTACCTAAAA TCCCTGAAAA CAAGTGGCAG AAAATCCATA ATTATGGAAT CAAAGTTCCA CAATTTTCAT TCATGCAGCT AGATGGTGCA GACATTGCAT TGGGTGTAGA GATGCAGTCT ACTGGTGAAG CTGCTTGCTT TGGAAATAGC TTCCATGATG CACTTGCAAA AGGTTTGACA TCAGTTGGAA TCAAACTACC TCAAACTGGA ACTGCAGTTG TTACTGTTGG GGGAACAGAA AACAAGGAGA AATTATTATC TTCAATTGCA AAACTAAAAC AATTAGGATT CAAGATTATG GCAACAGAAC ATACTGCAGA ATTCTTTGAA GAAAAAGTTG GTGGTATAGA AATCATTCAC AAGATTTCAG AACCTGAACG TCTACCAAAC ATTGCAGATA TGCTTTATGA AAGGAAGATA GACTTTATCA TAAACATCCC AAGTACTTCT ACAATTGAAA AATATGTTGG AATGCTTGAT GATGAATATC AAATTAGAAG AAAGGCAATT GAACTTGGAA TTCCAGTGCT AACTACAATA GAACTTGCTG ATTCATTTGT TAAGACCCTT GAATGGTTAC AACATAATGA AACAACAAAA GATCCTATTG AACCATATGA CCCTATTGAA TAA
|
Protein sequence | MPKNESLNKI LVLGSGAIKI GEAGEFDYSG SQCLKAIHED GLKSVLINPN IATIQTDTRF ADQVYLLPVN PQYVESVVEK ERPDGIMLAY GGQTALNCGV KLEEAGILQK YGVKVLGTQV QGIKNTEDRQ LFKDQMTEAG VPVLKSKTVT NVDDAKKVAE ELNYPVIVRV AYTLGGRGGG IAHNEIELHE IVERGLNASL VGQVLVEEYI GHWKQIEYEV MQDYDGNNVI VCNMENVLSM KVHTGDNIVV APSQTINNHE YHMLRSAALR ATKHVGIVGE CNIQYALDSD SDRYVAIEIN PRLSRSSALA SKATGYPLAY MSAKIGLGYS LSELVNRITK STTACFEPSL DYVVCKHPRW DFSKFEQVNR KLGVTMKSVG EVMAVGRTFE ESLQKAIRML AIGNDGLVLN RANGKKYTEE EIEFKLSHHD DEILYNVAIA LKMGISVERI YKLSAIDPWF IDKIQNILNA EAKIKESELD KSLMWDIKKL GFSDNQIARA KGSTPDEVRE IRKELGVVPS VKQIDTLAAE WPAVTNYLYL TYGGHSHDIE IPKDDPGIVV VGAGPYRIGS SVEFDWGTVN MVWGLQENGE KNVSVVNCNP ETVSTDYDIC TRLYFEELTQ ERLLDITDFE NPKGVITCVG GQTANNLTPG LAERGINILG TSAKDVDRAE DRSKFSAELD KLHIGQPRWQ AFSNLNEAKS FAQEVGFPVI VRPSYVLSGA AMKVVWSQEE LKTYVKEATD VSPDHPVVIS KFMLNSLEVD VDGISNGKEV VIGAIVEHID SAGVHSGDAM MCIPPWRLSN KIIETITDYT KRIALTFNVK GPFNLQFLIN NDQVYVIELN IRASRSMPFV SKLVKMNLIS LASKAILDKP LPKIPENKWQ KIHNYGIKVP QFSFMQLDGA DIALGVEMQS TGEAACFGNS FHDALAKGLT SVGIKLPQTG TAVVTVGGTE NKEKLLSSIA KLKQLGFKIM ATEHTAEFFE EKVGGIEIIH KISEPERLPN IADMLYERKI DFIINIPSTS TIEKYVGMLD DEYQIRRKAI ELGIPVLTTI ELADSFVKTL EWLQHNETTK DPIEPYDPIE
|
| |