Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0069 |
Symbol | carB |
ID | 8382329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 69742 |
End bp | 72963 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644971127 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_003128991 |
Protein GI | 257051158 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.503132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCCG ACGACCAACC GACGATCCTG TTGATCGGGA GCGGCCCGAT CCAGATCGGC CAGGCCGCAG AGTTCGACTA CTCCGGCGCA CAGGCGTGTC GCGCGCTCCA GGAGGAAGGC GCACGCGTCG TCCTGGTCAA CTCGAACCCG GCGACGATCA TGACCGACCC CGAGATGGCC GACGAGGTCT ATCTCGAACC GATCAATACC GAAGCCATTG CCGAGATCAT CCGCAAGGAG GAACCCGACG GCGTGATCGC GGGTCTGGGC GGCCAGACCG GGCTCAACGT CACCGCGGAG CTCGCTGAAG AGGGTGTCCT CGAGGAACAC GACGTCGACG TTATGGGGAC GCCGCTGGAT ACCATCTACG CGACCGAGGA CCGCGAGCAG TTCCGAAAGC GGATGGAGAA GATCGGCGAG CCGGTCCCCG CCTCGACGAC GATCAAGAGC ATGGACGAGG TCGAGGCGGC CGTCGAAGAA GTGGGTGGCC TCCCCGTCAT CATGCGGACA ACCTACACGC TCGGTGGCGC GGGATCGGGC GTCATCGGCG ACATGGACGA ACTCAAGGAA GCCACACGCA AGGGCCTGCG CCTCTCCCGG GACGACCGCG TGATGATCAC CGAGTCCATC GACGGTTGGA TCGAACTCGA GTACGAGGTG ATGCGCGACG CCGATGACTC CTGTATCATC ATCTGCAACA TGGAGAACCT GGATCCGATG GGGATCCACA CCGGGGAGTC GATGGTCGTC ACTCCCTCTC AGGTCATCCC CGACGAGGGC CACCAGGAGA TGCGCGACAC GGCGCTGAAG GTGATCCGCG AACTCGAGAT CCACGGCGGC TGTAACATCC AGTTCGCCTG GCGCGACGAT GGCTCGCCCG GCGGCGAGTA CCGCGTCGTC GAGGTCAACC CCCGCGTCTC TCGCTCGTCG GCACTTGCGT CAAAAGCGAC GGGCTACCCG ATCGCCCGCG TGACCGCGAA GGTCGCGATG GGCAAGCGCC TCCACGAGAT CGAAAACGAG ATCACCGGCG AGACCACCGC GGCCTTCGAA CCGGCCATCG ACTACGTCGT CACGAAGATC CCGCGGTGGC CGATCGACAA GTTCCGCGAC GTCGACTTCG AACTCGGGCC GGCGATGAAG TCGACCGGCG AGGCGATGTC GATCGGCCGG ACGTTCGAGG AGAGTCTGTT GAAGGCGCTA CGCTCGACAG AATACGACCC TGCAGTCAAC TGGACCGAGG TCAGCGACGA CGAACTCGAA TCGGAGTACC TCGTCCGCCC GTCGCCTGAC CGGCCATACG CGATGTTCGA GGCCTTCGAA CGCGGCTATT CGGTCGAGAC GGTCTCGGAA CTGACCGAGA TCCGCGAGTG GTACGTCGAA CGCTACAAAC GGATCGCCGA TGCCGCCGAC GCCGCCAGTG CGGGCGAACT CGATATCGCG GCCGAGGCCG GATTCACCAA CCACGAAGTC GCGACGCTCG CCAGCGGCGG TGCGTTCGAC GACACGCACG CCTCCTGGTT GCCAGACCGT CTCCTCGACG AGCGTGGCGT GGTCAGCGAC GACGGGGAGG CGACGCCACA GACCGACGGC GGCGGCGTCA GTGTCGCGGA CGTCGAAGAC GCCGCCCCCG CCCGCTCGTT CAAACAGGTC GACACCTGCG CGGGCGAGTT CGAGGCCTCG ACGCCGTACT ACTACTCGGC TCGCGAACCT CTCTCGGGGC TGGAGCGCAA CGAAGTCCAG GTCGATCCGG AGATCGAGAG CGTCGTGGTC GTCGGCGGCG GCCCGATCCG GATCGGGCAG GGCGTCGAGT TCGACTACTG TTCGGTCCAC GCCGTCCGTG CCTTAGAGGA GATCGGTATC GACGCCCACG TCGTCAACAA CAACCCCGAG ACGGTTTCGA CAGACTACGA CACGTCCGAC GGCCTGTTCT TCGAGCCGAT CACCGCCGAA GAGGTCGCCG ACGTGGTCGA GGAGACCGGG GCCGACGGCG TGATGGTCCA GTTCGGCGGT CAGACGTCGG TCGACATCGG CCATCCACTG GAGGCCGAAC TCGACCGCCG CGGACTCGAC TGTGAGGTCA TGGGGACCTC GGTCGACGCG ATGGACCTCG CGGAGGATCG CGACCGGTTC AACCGTCTGA TGGACGATCT GGGCATCGCC CAGGCGGAGG GTGGGACGGC GACCAGCGAA GCCGAGGCGC TCGAACTCGC GCGTGACATC GGTTACCCGG TGCTCGTCCG GCCGAGCTAC GTGCTCGGTG GGCGGGCCAT GGAAGTCGTC TACAACGACG ACGACCTCAA GACGTACATC GAGGAGGCCG TCCGGGTGAG TCCGGACAAG CCGATCCTCG TCGACGACTT CCTCGCGGAC GCCATCGAAC TCGACGTCGA CGCCGTGGCC GACGGCGACG ATATTCTCAT CGGCGGTGTC ATGGAACACG TCGAGACCGC CGGTGTTCAC TCAGGTGACT CGGCGTGTAT GATCCCCCCA CGTAGCGACG AGATCGAGAA CGTCATGCCG CGGATCCGTG AGGTGACCGA ACAGATCGCC GGCGCCTTAG AGACGGTCGG ACTCATGAAC GTCCAGCTCG CGGTCCGGGA CGGTGAAGTC TACGTCCTCG AAGCGAATCC ACGCTCCTCA CGGACGGTTC CGTTCGTCTC GAAAGCGACG GGCGTCCCGA TCGCCAAGCT GGCTGCGAAA GTAATGGCGG GTGCGAATCT CGACGAGCTC GACGTCGCCG AGCAACAGCC CGAGCAGGTC AGCGTCAAGG AAGTCGTCCT GCCGTTCGAT CGCCTCCCGG GTTCGGACCC GCGTCTCGGC CCCGAGATGA AGTCCACGGG TGAAGTCATG GGTACGGCCG GGAGCTTCGG GAAGGCCTAC CAGAAGGCCC AAATGGCGGT CGACAAGCCG ATCCCGCTTT CGGGCACTGC CATCGTCGAT CTGCCGATCG TCGGCTTCGA GGAACACTTC GAGACGCTTG ATCTCGATGA CTTCGACTCC CAGGCCGAGC TCGAGGCTGC ACTGCGGGAC GGCGAAATCG ATCTCGTGAT CTCCCGGAAT CGTGACGTTC TCGAAGTGTG TGTCGAGGAG TCGACTACCT ACTTCTCCAC GATCGCAAGC GGTAAGGCGG CCCTCGAGGC TATCGAGTCG GACGACCAGC CGCTGTCGGT CCAGGACGTC GCGACCCGTC CGAAAACGAC TCGGAAGTGG GGCGACAACT GA
|
Protein sequence | MTADDQPTIL LIGSGPIQIG QAAEFDYSGA QACRALQEEG ARVVLVNSNP ATIMTDPEMA DEVYLEPINT EAIAEIIRKE EPDGVIAGLG GQTGLNVTAE LAEEGVLEEH DVDVMGTPLD TIYATEDREQ FRKRMEKIGE PVPASTTIKS MDEVEAAVEE VGGLPVIMRT TYTLGGAGSG VIGDMDELKE ATRKGLRLSR DDRVMITESI DGWIELEYEV MRDADDSCII ICNMENLDPM GIHTGESMVV TPSQVIPDEG HQEMRDTALK VIRELEIHGG CNIQFAWRDD GSPGGEYRVV EVNPRVSRSS ALASKATGYP IARVTAKVAM GKRLHEIENE ITGETTAAFE PAIDYVVTKI PRWPIDKFRD VDFELGPAMK STGEAMSIGR TFEESLLKAL RSTEYDPAVN WTEVSDDELE SEYLVRPSPD RPYAMFEAFE RGYSVETVSE LTEIREWYVE RYKRIADAAD AASAGELDIA AEAGFTNHEV ATLASGGAFD DTHASWLPDR LLDERGVVSD DGEATPQTDG GGVSVADVED AAPARSFKQV DTCAGEFEAS TPYYYSAREP LSGLERNEVQ VDPEIESVVV VGGGPIRIGQ GVEFDYCSVH AVRALEEIGI DAHVVNNNPE TVSTDYDTSD GLFFEPITAE EVADVVEETG ADGVMVQFGG QTSVDIGHPL EAELDRRGLD CEVMGTSVDA MDLAEDRDRF NRLMDDLGIA QAEGGTATSE AEALELARDI GYPVLVRPSY VLGGRAMEVV YNDDDLKTYI EEAVRVSPDK PILVDDFLAD AIELDVDAVA DGDDILIGGV MEHVETAGVH SGDSACMIPP RSDEIENVMP RIREVTEQIA GALETVGLMN VQLAVRDGEV YVLEANPRSS RTVPFVSKAT GVPIAKLAAK VMAGANLDEL DVAEQQPEQV SVKEVVLPFD RLPGSDPRLG PEMKSTGEVM GTAGSFGKAY QKAQMAVDKP IPLSGTAIVD LPIVGFEEHF ETLDLDDFDS QAELEAALRD GEIDLVISRN RDVLEVCVEE STTYFSTIAS GKAALEAIES DDQPLSVQDV ATRPKTTRKW GDN
|
| |