Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3296 |
Symbol | |
ID | 8743916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3388418 |
End bp | 3391642 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513879 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003404833 |
Protein GI | 284166554 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGG ACCAGCCAAG CGAGGGCGAG ACAGGGGACG GACGCACGAT TCTGCTGATC GGGAGCGGCC CGATCCAGAT CGGACAGGCA GCCGAGTTCG ACTACTCCGG CGCACAGGCC TGCCGGGCGC TACAGGAGGA GGGCGCCCGC GTCGTCCTCG TCAACTCCAA CCCCGCGACG ATCATGACGG ACCCGGAGAT GGCCGACCGC GTTTACATCG AGCCGATCAC GACCGAGGCC ATCGCCGAGA TCATCCGCAA GGAGCAACCC GACGGCGTCA TCGCCGGGCT GGGTGGCCAG ACCGGTCTGA ACGTCACGGC CGAACTGGCC GAAGAGGGCG TTCTCGAGGA GTACGATGTC GATATCATGG GGACCCCCCT CGATACGATC TACGCCACGG AGGACCGCGA CCTCTTCCGC CAGCGCATGG AGAAGATCGG CCAGCCGGTT CCGAAGTCGA CGACCATCTC GCTCGAGGAG GGCGAGCAGG TCTCGGAACT GACCGAGGAG GACCTCAGAG AGCGCGTCGA GGCCGCCGTC GAGGAGGTCG GCGGGCTGCC GGTCATCGCC CGCACCACGT ACACGCTGGG CGGCTCCGGC TCGGGCGTCG TCCACGAGTT CGACGAACTG CTGGCCCGCG TCCGCAAGGG CCTGCGCCTC TCGCGTAACA GCGAGGTGCT GATCACCGAG TCCATCGCGG GCTGGGTCGA GTACGAGTAC GAGGTCATGC GCGACGCCGA CGACTCCTGT ATCATCATCT GCAACATGGA GAACATCGAC CCCATGGGAA TCCACACCGG GGAGTCGACG GTCGTCACGC CCTCCCAGAT CGTCCCCGAC GAGGGCCACC AGGAGATGCG CACCGCCGCG CTCGACGTCA TCCGCGAACT CGGCATTCAG GGCGGCTGTA ACATTCAGTT CGCCTGGCAC GACGACGGCA CGCCAGGCGG CGAGTACCGC GTCGTCGAGG TCAACCCGCG CGTCTCCCGC TCCTCCGCGC TGGCCTCCAA GGCGACCGGC TACCCGATCG CCCGCGTGAC CGCGAAGGTC GCACTCGGCA AGCGCCTCCA CGAGATCGAG AACGAAATTA CCGGCGAGAC CACCGCCGCC TTCGAGCCCG CGATCGACTA CGTGGTTACG AAGGTGCCGC GCTGGCCCAA AGACAAGTTC GACGACGTCG ACTTCGAGCT GACGACGGCG ATGAAGTCGA CCGGCGAGGC GATGGCCATC GGCCGAACCT TCGAGGAGAG CCTCCTCAAG GCGCTTCGCT CGAGCGAGTA CGAACCCGAC GTCGACTGGG CCGAGGTCAG CGACGAGGAA CTCGAGGAGC AGTATCTGGA GCGTCCCTCC CCTGACCGCC CGTACGCGAT GTTCGAAGCC TACGAGCGCG GTTACACGGT CGACGAGGTC GTTGAACTGA CGGGCATCTT CGAGTGGTAC GCCGAGCGCT TCAAGCGCAT CGCCGACTCG TCGCTCGCCG CCCAGGAGGG CGACTTCACC GAGGCCGCGA TCGCCGGCCA CACCAACGCG TCGATCGCCG CGACGGCCGG CGCCGACGTC GACACCGTCG AACAGGAGGT GCCGGGTCGC ACCTACAAGC AGGTCGACAC CTGCGCCGGC GAGTTCGAGG CCGAGACGCC CTACTACTAC TCCGCCCGCA AGAACGAGTA CGAGAAAGGG CCGCTGCTGG GCGACGCCGC GTCGGGCGAG CTCGAGGTCG ACCGCGACTT AGAGAGCGTG ATCGTCGTCG GCGGCGGACC GATCCGCATC GGACAGGGCG TCGAGTTCGA CTACTGTTCG GTCCACGCGG TCCAGGCGCT TCGCGACATG GGCATCGAGG CCCACGTCGT CAACAACAAC CCCGAGACCG TCTCGACCGA CTACGACACC TCCGACGGCC TCTTCTTCGA GCCGATCACG GCCGAAGAGG TCGCCGACGT CGCCGAGGCG ACCGGCGCCG ACGGCGTGAT GGTCCAGTTC GGCGGCCAGA CCTCCGTCAA CATCGGCGAA CCGCTCGAGG ACGAACTCGA GCGCCGCGGG CTCGACTGTA CGGTCATGGG CACCTCCGTC GAGGCGATGG ACTTAGCCGA GGACCGCGAC CGCTTCAACG CCCTGATGGA CGATCTGGGC ATCGCCCAGC CGGAAGGCGG CGCCGCCCAC AGCAAGGAAG AGGCCATGCA GCTGGCCCAC GAGATCGGCT ACCCGGTCCT CGTGCGCCCC TCCTACGTGC TCGGCGGCCG CGCGATGGAC GTCGTCTACG ACGACGCCGA ACTCGAGACC TACATCGAGG AGGCCGTCCG CGTCTCCCCG GACAAGCCGA TTCTGGTTGA CGACTTCCTC GAGGACGCGG TCGAACTCGA CGTCGACGCC GTGGCCGACG GCGAGGACGT TCTGATCGGC GGCGTGATGG AACACGTCGA GGCCGCGGGG GTCCACTCCG GCGACTCGGC CTGTATGATC CCGCCGCGCT CGCTGGACGA CGAGACGCTC GAGCGCGTTC GCGAGGTCAC CGAGGACATC GCGACGGCGC TGGACACCGT CGGGCTGCTC AACGTCCAGC TCGCGGTCCG CGACGGCGAG GTCTACGTCC TCGAGGCGAA CCCGCGCTCC TCGCGTACCG TCCCCTTCAT CTCGAAGGCC ACGGGCGTCC CGATCGCCAA GATCGCCGCG AAGGTCATGG CCGGCGAGAC GCTGGCGGAC CTCGCGATCG AGGAACAGAT CCCCGAGCAG ACCTCGATCA AGGAGGTCGT CCTGCCGTTC GACCGCCTGC CGGGCTCGGA CCCGCGTCTC GGCCCGGAGA TGAAGTCCAC CGGCGAAGTC ATGGGCAGCG CCGACACCTT CGGCAAGGCC TACGACAAGG CCCAGGACGC GACGAACAAG CCGATTCCCG AGTCGGGTAC CGCGATCATC GACCTCTCGG CCGACAAGTT CCCGGACCCG GAGACCGAGG AGGGCGAGGC ACTGGTCGAC GGCTTCACCG AGTACTTCGA CCTCTGCGAG GAAGTCGACC TCGCACAGGC CGTCCGCGAG GGCAAGGTCG ACCTCATCGT CTCCCGCGAC CGCGACCTGC TCGAGGTCGC CGTCGAGGAG GAAATCACCT ACTTCTCGAC GCCCGCCAGC GCGTCCGCTG CGCTCGAGGC GCTCGAGGCG AAGGACGAAC CGATCGACGT GCAGGCGATC ACCGACCGTC CGAAGCGCAC CGCCGAGTGG GGCCGCTCGG ACTGA
|
Protein sequence | MSTDQPSEGE TGDGRTILLI GSGPIQIGQA AEFDYSGAQA CRALQEEGAR VVLVNSNPAT IMTDPEMADR VYIEPITTEA IAEIIRKEQP DGVIAGLGGQ TGLNVTAELA EEGVLEEYDV DIMGTPLDTI YATEDRDLFR QRMEKIGQPV PKSTTISLEE GEQVSELTEE DLRERVEAAV EEVGGLPVIA RTTYTLGGSG SGVVHEFDEL LARVRKGLRL SRNSEVLITE SIAGWVEYEY EVMRDADDSC IIICNMENID PMGIHTGEST VVTPSQIVPD EGHQEMRTAA LDVIRELGIQ GGCNIQFAWH DDGTPGGEYR VVEVNPRVSR SSALASKATG YPIARVTAKV ALGKRLHEIE NEITGETTAA FEPAIDYVVT KVPRWPKDKF DDVDFELTTA MKSTGEAMAI GRTFEESLLK ALRSSEYEPD VDWAEVSDEE LEEQYLERPS PDRPYAMFEA YERGYTVDEV VELTGIFEWY AERFKRIADS SLAAQEGDFT EAAIAGHTNA SIAATAGADV DTVEQEVPGR TYKQVDTCAG EFEAETPYYY SARKNEYEKG PLLGDAASGE LEVDRDLESV IVVGGGPIRI GQGVEFDYCS VHAVQALRDM GIEAHVVNNN PETVSTDYDT SDGLFFEPIT AEEVADVAEA TGADGVMVQF GGQTSVNIGE PLEDELERRG LDCTVMGTSV EAMDLAEDRD RFNALMDDLG IAQPEGGAAH SKEEAMQLAH EIGYPVLVRP SYVLGGRAMD VVYDDAELET YIEEAVRVSP DKPILVDDFL EDAVELDVDA VADGEDVLIG GVMEHVEAAG VHSGDSACMI PPRSLDDETL ERVREVTEDI ATALDTVGLL NVQLAVRDGE VYVLEANPRS SRTVPFISKA TGVPIAKIAA KVMAGETLAD LAIEEQIPEQ TSIKEVVLPF DRLPGSDPRL GPEMKSTGEV MGSADTFGKA YDKAQDATNK PIPESGTAII DLSADKFPDP ETEEGEALVD GFTEYFDLCE EVDLAQAVRE GKVDLIVSRD RDLLEVAVEE EITYFSTPAS ASAALEALEA KDEPIDVQAI TDRPKRTAEW GRSD
|
| |