Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0181 |
Symbol | |
ID | 8382443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 180491 |
End bp | 181690 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644971239 |
Product | chorismate synthase |
Protein accession | YP_003129102 |
Protein GI | 257051269 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGTA ACAGCTTCGG TCGGCTTTTT CAGGTGACCA CGTACGGCGA GTCACACGGC GAGGCGATGG GGTGTACGGT TTCGGGCGTC CCGGCGGGCG TCGAACTCGA CGAGGATGAC ATCCAGCACG ACCTCGACCG GCGCAAACCC GGCCAGTCGA TGATCACGAC CTCGCGGGAC GAACCCGACG CCGTCACGAT CAACTCGGGC CTGCAGGACG GCTACACGAC TGGCACGCCG ATCGGGATGG TCATCCAGAA CAAGGATTCC CGTTCGGGCA AGTACGAGCC CTTCGTCACG GCCCCCCGCC CCTCCCACGG AGACTTTACC TACTCGGCGA AGTTCGGCAC GCGCAACTGG GGCGGCGGTG GCCGGTCCTC GGCCCGGGAG ACGGTCAACT GGGTCGCCGC GGGTGCCATC GCCAAGCAGG TCCTCGACCA GAGCGAGTAC GATGTCCAGA TCAAGGCTCA CGTCAACCAG ATCGGCGAGA TCGAAGCCCC GGATGTCACC TTCGAGGAGA TGCTCGAACA CAGCGAAGAA AACGACATCC GGTGTGCTCA CCCGGAAACT GCCGAGGAGA TGCGCGATCT GGCCGAACAG TACCAGCAAG AAGGTGATTC CATCGGCGGG TCGGTCTACT TCGAATGCCA GGGCGTCCCG CGGGGACTCG GCGCGCCCCG CTTCGACTCC ATCCCTTCGC GGCTGGGACA ACTGATCTAC TCGATCCCGG CGGTCAACGA CTTCGAGTAC GGGGTCGGCC GCGACGCCCG GACGATGGCC GGTAGCGAGT ACAACGAGGA CTGGGAATTT GATTCGAATG GGGATCCGGT CCCCGTCGGC AACGACCACG GCGGGATCCA GGGCGGGATC ACGACCGGCG ACCCCATCTA CGGCGAGATC ACCTGGCATC CGCCGGTCTC GATCCCGAAA GCCCAGGAGA CCGTCGACTG GGAGACGGGC GAGCGCAAGG AGATCCAGGT CGTCGGCCGC CACGACCCCG TTCTCCCGCC GCGTGCCGTC CCCGTCGTCG AGGGGCTCCT GTACTGTACG GTGCTAGACT TCATGCTGCT CGGTGGACGG ATCAACCCCG ACCGACTCGA CGGCCGGCCC GGCGAGTACG ACACCGACTA CCATCCCTCG AGTCCGGTCA ACGACCCCGA CGACGCCGCG ACGCAGGCCG AGACGATCGA CGACGAGTGA
|
Protein sequence | MNGNSFGRLF QVTTYGESHG EAMGCTVSGV PAGVELDEDD IQHDLDRRKP GQSMITTSRD EPDAVTINSG LQDGYTTGTP IGMVIQNKDS RSGKYEPFVT APRPSHGDFT YSAKFGTRNW GGGGRSSARE TVNWVAAGAI AKQVLDQSEY DVQIKAHVNQ IGEIEAPDVT FEEMLEHSEE NDIRCAHPET AEEMRDLAEQ YQQEGDSIGG SVYFECQGVP RGLGAPRFDS IPSRLGQLIY SIPAVNDFEY GVGRDARTMA GSEYNEDWEF DSNGDPVPVG NDHGGIQGGI TTGDPIYGEI TWHPPVSIPK AQETVDWETG ERKEIQVVGR HDPVLPPRAV PVVEGLLYCT VLDFMLLGGR INPDRLDGRP GEYDTDYHPS SPVNDPDDAA TQAETIDDE
|
| |