Gene Hmuk_1632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1632 
SymbolcarB 
ID8411155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1553575 
End bp1556862 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content66% 
IMG OID645019959 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_003177453 
Protein GI257387680 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA GCGAAGACCG CACAATCTTG CTCATCGGGA GTGGCCCGAT CCAGATCGGA 
CAGGCGGCCG AGTTCGACTA CTCCGGGGCA CAGGCCTGTC GCGCCCTGCA GGAGGAGGGC
GCGCGGGTCG TCCTGGTGAA CTCGAATCCG GCGACGATCA TGACCGACCC GGAGATGGCC
GACAAGGTGT ATCTCGAACC GATCAACACC GAGGCGATCT CCGAGATCAT CCGGAAGGAA
GATCCCGACG GCGTCATCGC CGGTCTCGGC GGTCAGACCG GCCTCAACGT CACGGCCGAG
CTGGCCGAGG AGGGGGTCCT CGAAGAGCAC GACGTCGAGA TCATGGGCAC ACCGCTGGAC
ACGATCTACG CGACGGAGGA TCGTGACCTC TTCAAACAGC GCATGGAGGA AATCGGCGAG
CCGGTGCCGT CTTCGACGAC GATCACCCTC GACGAGGGTG AGACCGTCAC CGAACTGACC
GAGGAGAGCC TGCGCGACCG CGTCGAGGAC GCCGTCGACG AGGTCGGCGG CCTCCCGGTC
ATCGCACGCA CGACGTACAC GCTCGGTGGC TCTGGCTCCG GCGTCGTCCA CGAGATGGAG
AAACTCATCG AGCGCGTCCG CAAGGGGCTG CGCCTCTCTC GGAACAGCGA GGTGCTGATC
ACCGAGTCCA TCGAGGGGTG GGTCGAGCTG GAGTACGAGG TGATGCGCGA CGCCGACGAC
TCGTGTATCA TCATCTGTAA CATGGAGAAC ATCGACCCCA TGGGGATCCA CACCGGCGAG
TCGACGGTCG TCACGCCCTC GCAGGTCATC CCCGACGAGG GCCACCAGGC GATGCGCGAC
TCCGCGCTGA AGGTCATCCG CGAACTCGGG ATCGAGGGCG GCTGTAACAT CCAGCACGCC
TGGCGCGACG ACGGCACGCC CGGCGGCGAG TACCGCGTCG TCGAGGTCAA CCCACGGGTC
TCGCGATCCT CGGCGCTGGC CTCGAAGGCG ACGGGCTACC CGATCGCTCG CGTCACCGCG
AAGGTCGCGA TGGGCAAACG CCTCCACGAG ATCGACAACG AGATCACCGG CGAGACCACG
GCGGCCTTCG AGCCCGCCAT CGACTACGTC GTCACGAAGG TCCCGCGCTG GCCGATCGAC
AAGTTCCGCG ACGTGGAGTT CGAGCTGTCG ACGGCGATGA AATCGACCGG CGAGGCGATG
GCCATCGGCC GCACCTTCCC CGAGTCGATG CTGAAGGCGT TGCGGTCCTC GGAGTACGAC
CCCGCCGTCG ACTGGGGCGA GGTCGACGAC GACGAGCTCG AAGAGGAGTA CCTGATCCGC
CCGACGCCGG ATCGCCCCTA CGCCATCTTC GAGGCCTTCG CCCGCGGGTA CACGGTCGAC
GAGATCGTCG AGCTGACCGA CATCGAGCGG TGGTACGTCG AGCGCTTCCA GCGGATCGAA
GAAGCCGCCG AGGCCGCGCA GAACGGCGAC TTCGCGACCG CGGCCGAGGC CGGCTTCACC
GACCAGGAGA TCACCGCGAT GGCCGGCGGC GAGTTCAACG ACACGCACGC CTCCTGGGTG
CCGGAGGGTA ATCTCAGCGA GAAAGGCGAC GAGATCGAAG CCGCTACAGA CGGGTCGGGT
GTCACCGTCG AGGACGTCGA GTCAGAGACG GTCGATCGCG ACTTCAAGCT CGTCGACACC
TGTGCCGGCG AGTTCGAGGC GACGACGCCG TACTACTACT CGACGCGAGA TCCCGTCTCG
GGGATCGACC GCAACGAACT GCAGATCGAC CCCGACGTAG AGAGCGTCGT CGTGGTCGGT
GGCGGCCCGA TCCGGATCGG GCAGGGCGTC GAGTTCGACT ACTGTTCGGT CCACGCGGTC
CGCGCGCTCG AAGAGGCCGG CATCGACGCC CACGTGGTCA ACAACAACCC CGAGACCGTC
TCGACGGACT ACGACACCAG TGATGGCCTG TTCTTCGAGC CGGTCACGGC CGAGGAAGTC
GCCGACGTGG TCGAAGCCAC GAACGCCGAC GGCGTGATGG TCCAGTTCGG CGGCCAGACC
TCCGTCGACA TCGGCCAGCC GCTCGAACGG GAACTCGACC GCCGCGGCCT CGACTGTGAG
ATCATGGGCA CTGCGGTCGA CGCGATGGAC CTCGCGGAGG ACCGCGACCG GTTCAACCAG
CTGATGGACG AGCTGGGTAT CGCACAGGCA GAAGGCGGGA CGGCGACCAG CAAGGAGGAG
GCGCTGGACC TGGCTCACGA CATCGGCTAC CCCGTCCTGG TTCGTCCGAG CTACGTGCTC
GGCGGCCGCG CGATGGACGT GGTGTACAAC GACGAGGACC TCGAAACCTA CATCGAGGAG
GCCGTCCGCG TCTCCCCGGA CAAGCCGATC CTCGTGGACG ACTTCCTCGC GGACGCGATC
GAGTTAGACG TCGACGCCGT CGCCGACGTC GGCGCGTCCC ACGCGCCTGC AAGCGAGACG
GAGTCTCGCC CTGGCGAGAA CGTGCTGATC GGCGGCGTCA TGGAGCACGT CGAGACCGCG
GGCGTCCACT CCGGTGACTC CGCGTGTATG ATCCCGCCGC GCTCCCAGGA GATCAAAGAC
GTGATGCCCC GCATCCGCGA GGTGACGGAA GACATCGCGT CGGCCCTGGA GACCGAGGGA
CTGCTCAACG TCCAGCTCGC GGTCCGGGAC GGCGAGGTGT ACGTGCTCGA AGCCAACCCG
CGTTCCTCCC GAACCGTGCC GTTCATCGCG AAGACGACGG GCGTTCCCCT GGCCAAGATC
GCGGCGAAGG TCATGGCCGG CGCGAGCCTC GAAGAACTCG ATTACGAGGA ACAGATCCCG
GATCAGGTCT CGGTCAAGGA GGTCGTCCTG CCCTTCGACC GCCTGCCGGG CTCGGATCCG
CGCCTCGGCC CGGAGATGAA GTCCACCGGC GAGGTCATGG GGACGGCCGG CTCCTTCGGC
AAGGCCTACC AGAAGGCCCA GATGTCCGTC GACAAGCCGA TCCCGCTGGA CGGGACGGCG
CTGGTCGACA TGCCGATCAT CGGCTTCGAG GACCACTTCG ACGTGCTCGA TTTCGACGAC
TTCGAGGACG TCGACGCCAT CGTCGAAGCG ATCCAGAACG GCGAGATCGA CATGGTGCTG
TCGCGCAACC GCGACGTGCT AGAAGCGTGT GTCGAAGAGA CGGTCACGTA CTTCTCGACG
CGCGAGTCCG CCGAGGCCGC CCTCGAAGCG ATCAATGCGA ACGACCAGCC CCTGAACGTC
CAGGACATCG CCTCGCGGCC CAAGACCCAG CGCGAGTGGG GCCGCTGA
 
Protein sequence
MTESEDRTIL LIGSGPIQIG QAAEFDYSGA QACRALQEEG ARVVLVNSNP ATIMTDPEMA 
DKVYLEPINT EAISEIIRKE DPDGVIAGLG GQTGLNVTAE LAEEGVLEEH DVEIMGTPLD
TIYATEDRDL FKQRMEEIGE PVPSSTTITL DEGETVTELT EESLRDRVED AVDEVGGLPV
IARTTYTLGG SGSGVVHEME KLIERVRKGL RLSRNSEVLI TESIEGWVEL EYEVMRDADD
SCIIICNMEN IDPMGIHTGE STVVTPSQVI PDEGHQAMRD SALKVIRELG IEGGCNIQHA
WRDDGTPGGE YRVVEVNPRV SRSSALASKA TGYPIARVTA KVAMGKRLHE IDNEITGETT
AAFEPAIDYV VTKVPRWPID KFRDVEFELS TAMKSTGEAM AIGRTFPESM LKALRSSEYD
PAVDWGEVDD DELEEEYLIR PTPDRPYAIF EAFARGYTVD EIVELTDIER WYVERFQRIE
EAAEAAQNGD FATAAEAGFT DQEITAMAGG EFNDTHASWV PEGNLSEKGD EIEAATDGSG
VTVEDVESET VDRDFKLVDT CAGEFEATTP YYYSTRDPVS GIDRNELQID PDVESVVVVG
GGPIRIGQGV EFDYCSVHAV RALEEAGIDA HVVNNNPETV STDYDTSDGL FFEPVTAEEV
ADVVEATNAD GVMVQFGGQT SVDIGQPLER ELDRRGLDCE IMGTAVDAMD LAEDRDRFNQ
LMDELGIAQA EGGTATSKEE ALDLAHDIGY PVLVRPSYVL GGRAMDVVYN DEDLETYIEE
AVRVSPDKPI LVDDFLADAI ELDVDAVADV GASHAPASET ESRPGENVLI GGVMEHVETA
GVHSGDSACM IPPRSQEIKD VMPRIREVTE DIASALETEG LLNVQLAVRD GEVYVLEANP
RSSRTVPFIA KTTGVPLAKI AAKVMAGASL EELDYEEQIP DQVSVKEVVL PFDRLPGSDP
RLGPEMKSTG EVMGTAGSFG KAYQKAQMSV DKPIPLDGTA LVDMPIIGFE DHFDVLDFDD
FEDVDAIVEA IQNGEIDMVL SRNRDVLEAC VEETVTYFST RESAEAALEA INANDQPLNV
QDIASRPKTQ REWGR