Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1632 |
Symbol | carB |
ID | 8411155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1553575 |
End bp | 1556862 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645019959 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_003177453 |
Protein GI | 257387680 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAGA GCGAAGACCG CACAATCTTG CTCATCGGGA GTGGCCCGAT CCAGATCGGA CAGGCGGCCG AGTTCGACTA CTCCGGGGCA CAGGCCTGTC GCGCCCTGCA GGAGGAGGGC GCGCGGGTCG TCCTGGTGAA CTCGAATCCG GCGACGATCA TGACCGACCC GGAGATGGCC GACAAGGTGT ATCTCGAACC GATCAACACC GAGGCGATCT CCGAGATCAT CCGGAAGGAA GATCCCGACG GCGTCATCGC CGGTCTCGGC GGTCAGACCG GCCTCAACGT CACGGCCGAG CTGGCCGAGG AGGGGGTCCT CGAAGAGCAC GACGTCGAGA TCATGGGCAC ACCGCTGGAC ACGATCTACG CGACGGAGGA TCGTGACCTC TTCAAACAGC GCATGGAGGA AATCGGCGAG CCGGTGCCGT CTTCGACGAC GATCACCCTC GACGAGGGTG AGACCGTCAC CGAACTGACC GAGGAGAGCC TGCGCGACCG CGTCGAGGAC GCCGTCGACG AGGTCGGCGG CCTCCCGGTC ATCGCACGCA CGACGTACAC GCTCGGTGGC TCTGGCTCCG GCGTCGTCCA CGAGATGGAG AAACTCATCG AGCGCGTCCG CAAGGGGCTG CGCCTCTCTC GGAACAGCGA GGTGCTGATC ACCGAGTCCA TCGAGGGGTG GGTCGAGCTG GAGTACGAGG TGATGCGCGA CGCCGACGAC TCGTGTATCA TCATCTGTAA CATGGAGAAC ATCGACCCCA TGGGGATCCA CACCGGCGAG TCGACGGTCG TCACGCCCTC GCAGGTCATC CCCGACGAGG GCCACCAGGC GATGCGCGAC TCCGCGCTGA AGGTCATCCG CGAACTCGGG ATCGAGGGCG GCTGTAACAT CCAGCACGCC TGGCGCGACG ACGGCACGCC CGGCGGCGAG TACCGCGTCG TCGAGGTCAA CCCACGGGTC TCGCGATCCT CGGCGCTGGC CTCGAAGGCG ACGGGCTACC CGATCGCTCG CGTCACCGCG AAGGTCGCGA TGGGCAAACG CCTCCACGAG ATCGACAACG AGATCACCGG CGAGACCACG GCGGCCTTCG AGCCCGCCAT CGACTACGTC GTCACGAAGG TCCCGCGCTG GCCGATCGAC AAGTTCCGCG ACGTGGAGTT CGAGCTGTCG ACGGCGATGA AATCGACCGG CGAGGCGATG GCCATCGGCC GCACCTTCCC CGAGTCGATG CTGAAGGCGT TGCGGTCCTC GGAGTACGAC CCCGCCGTCG ACTGGGGCGA GGTCGACGAC GACGAGCTCG AAGAGGAGTA CCTGATCCGC CCGACGCCGG ATCGCCCCTA CGCCATCTTC GAGGCCTTCG CCCGCGGGTA CACGGTCGAC GAGATCGTCG AGCTGACCGA CATCGAGCGG TGGTACGTCG AGCGCTTCCA GCGGATCGAA GAAGCCGCCG AGGCCGCGCA GAACGGCGAC TTCGCGACCG CGGCCGAGGC CGGCTTCACC GACCAGGAGA TCACCGCGAT GGCCGGCGGC GAGTTCAACG ACACGCACGC CTCCTGGGTG CCGGAGGGTA ATCTCAGCGA GAAAGGCGAC GAGATCGAAG CCGCTACAGA CGGGTCGGGT GTCACCGTCG AGGACGTCGA GTCAGAGACG GTCGATCGCG ACTTCAAGCT CGTCGACACC TGTGCCGGCG AGTTCGAGGC GACGACGCCG TACTACTACT CGACGCGAGA TCCCGTCTCG GGGATCGACC GCAACGAACT GCAGATCGAC CCCGACGTAG AGAGCGTCGT CGTGGTCGGT GGCGGCCCGA TCCGGATCGG GCAGGGCGTC GAGTTCGACT ACTGTTCGGT CCACGCGGTC CGCGCGCTCG AAGAGGCCGG CATCGACGCC CACGTGGTCA ACAACAACCC CGAGACCGTC TCGACGGACT ACGACACCAG TGATGGCCTG TTCTTCGAGC CGGTCACGGC CGAGGAAGTC GCCGACGTGG TCGAAGCCAC GAACGCCGAC GGCGTGATGG TCCAGTTCGG CGGCCAGACC TCCGTCGACA TCGGCCAGCC GCTCGAACGG GAACTCGACC GCCGCGGCCT CGACTGTGAG ATCATGGGCA CTGCGGTCGA CGCGATGGAC CTCGCGGAGG ACCGCGACCG GTTCAACCAG CTGATGGACG AGCTGGGTAT CGCACAGGCA GAAGGCGGGA CGGCGACCAG CAAGGAGGAG GCGCTGGACC TGGCTCACGA CATCGGCTAC CCCGTCCTGG TTCGTCCGAG CTACGTGCTC GGCGGCCGCG CGATGGACGT GGTGTACAAC GACGAGGACC TCGAAACCTA CATCGAGGAG GCCGTCCGCG TCTCCCCGGA CAAGCCGATC CTCGTGGACG ACTTCCTCGC GGACGCGATC GAGTTAGACG TCGACGCCGT CGCCGACGTC GGCGCGTCCC ACGCGCCTGC AAGCGAGACG GAGTCTCGCC CTGGCGAGAA CGTGCTGATC GGCGGCGTCA TGGAGCACGT CGAGACCGCG GGCGTCCACT CCGGTGACTC CGCGTGTATG ATCCCGCCGC GCTCCCAGGA GATCAAAGAC GTGATGCCCC GCATCCGCGA GGTGACGGAA GACATCGCGT CGGCCCTGGA GACCGAGGGA CTGCTCAACG TCCAGCTCGC GGTCCGGGAC GGCGAGGTGT ACGTGCTCGA AGCCAACCCG CGTTCCTCCC GAACCGTGCC GTTCATCGCG AAGACGACGG GCGTTCCCCT GGCCAAGATC GCGGCGAAGG TCATGGCCGG CGCGAGCCTC GAAGAACTCG ATTACGAGGA ACAGATCCCG GATCAGGTCT CGGTCAAGGA GGTCGTCCTG CCCTTCGACC GCCTGCCGGG CTCGGATCCG CGCCTCGGCC CGGAGATGAA GTCCACCGGC GAGGTCATGG GGACGGCCGG CTCCTTCGGC AAGGCCTACC AGAAGGCCCA GATGTCCGTC GACAAGCCGA TCCCGCTGGA CGGGACGGCG CTGGTCGACA TGCCGATCAT CGGCTTCGAG GACCACTTCG ACGTGCTCGA TTTCGACGAC TTCGAGGACG TCGACGCCAT CGTCGAAGCG ATCCAGAACG GCGAGATCGA CATGGTGCTG TCGCGCAACC GCGACGTGCT AGAAGCGTGT GTCGAAGAGA CGGTCACGTA CTTCTCGACG CGCGAGTCCG CCGAGGCCGC CCTCGAAGCG ATCAATGCGA ACGACCAGCC CCTGAACGTC CAGGACATCG CCTCGCGGCC CAAGACCCAG CGCGAGTGGG GCCGCTGA
|
Protein sequence | MTESEDRTIL LIGSGPIQIG QAAEFDYSGA QACRALQEEG ARVVLVNSNP ATIMTDPEMA DKVYLEPINT EAISEIIRKE DPDGVIAGLG GQTGLNVTAE LAEEGVLEEH DVEIMGTPLD TIYATEDRDL FKQRMEEIGE PVPSSTTITL DEGETVTELT EESLRDRVED AVDEVGGLPV IARTTYTLGG SGSGVVHEME KLIERVRKGL RLSRNSEVLI TESIEGWVEL EYEVMRDADD SCIIICNMEN IDPMGIHTGE STVVTPSQVI PDEGHQAMRD SALKVIRELG IEGGCNIQHA WRDDGTPGGE YRVVEVNPRV SRSSALASKA TGYPIARVTA KVAMGKRLHE IDNEITGETT AAFEPAIDYV VTKVPRWPID KFRDVEFELS TAMKSTGEAM AIGRTFPESM LKALRSSEYD PAVDWGEVDD DELEEEYLIR PTPDRPYAIF EAFARGYTVD EIVELTDIER WYVERFQRIE EAAEAAQNGD FATAAEAGFT DQEITAMAGG EFNDTHASWV PEGNLSEKGD EIEAATDGSG VTVEDVESET VDRDFKLVDT CAGEFEATTP YYYSTRDPVS GIDRNELQID PDVESVVVVG GGPIRIGQGV EFDYCSVHAV RALEEAGIDA HVVNNNPETV STDYDTSDGL FFEPVTAEEV ADVVEATNAD GVMVQFGGQT SVDIGQPLER ELDRRGLDCE IMGTAVDAMD LAEDRDRFNQ LMDELGIAQA EGGTATSKEE ALDLAHDIGY PVLVRPSYVL GGRAMDVVYN DEDLETYIEE AVRVSPDKPI LVDDFLADAI ELDVDAVADV GASHAPASET ESRPGENVLI GGVMEHVETA GVHSGDSACM IPPRSQEIKD VMPRIREVTE DIASALETEG LLNVQLAVRD GEVYVLEANP RSSRTVPFIA KTTGVPLAKI AAKVMAGASL EELDYEEQIP DQVSVKEVVL PFDRLPGSDP RLGPEMKSTG EVMGTAGSFG KAYQKAQMSV DKPIPLDGTA LVDMPIIGFE DHFDVLDFDD FEDVDAIVEA IQNGEIDMVL SRNRDVLEAC VEETVTYFST RESAEAALEA INANDQPLNV QDIASRPKTQ REWGR
|
| |