Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1114 |
Symbol | |
ID | 7400923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1120190 |
End bp | 1121377 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643708179 |
Product | chorismate synthase |
Protein accession | YP_002565778 |
Protein GI | 222479541 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGGA ACCGGTTCGG TCGGCTCTTC CAGGTGACGA CGTACGGCGA GAGCCACGGC GAGGCGATGG GTGTGACGGT CTCGGGCGTG CCCGCCGGCG TCGAGCTCGA CGAGGAGGCG ATCCAAGCAC AGCTTGACCG GCGCAAGCCG GGCCAGTCGA TGATCACCAC CTCCCGGGGC GAGCCCGACG AGGTCGTCGT CAACTCCGGC GTACAGGACG GCTACACCAC CGGAACGCCG ATCGGAATGG TGATCCAGAA CAAGGACGCG CGCTCGGGGA AGTACGAGCC GTACGTCACC GCGCCGCGCC CATCGCACGG CGATTACACC TACTCCGCGA AGTTCGGCAC GCGCAACTGG GGCGGCGGCG GGCGCTCCTC CGCCCGGGAG ACGGTGAACT GGGTCGCGGC CGGCGCGGTC GCCGAGCAGG TGCTCGACGC CTCCGAGTAC GACGTGGAGA TCAAAGCCCA CGTGAACCAG ATCGGCGACG TCGAGGCCGA CGACGTGAGC TTCGAGCAGA TACTCGACCA CAGCGAGGAG AACGACGTGC GCTGTGCCGA CCCCGAGGCG GCCGCCGAGA TGCAGGAGCT GATTGAACGG TATCAGGAGG CGGGCGACTC CATCGGCGGC TCCATCTACT TCGAGTGCCG CGGCGTCCCC CGCGGGCTCG GCGCCCCGCG CTTCGACGGC TTCCCGTCCC GGCTCGGGCA GGCGATGTTC TCGATCCCGG CGACCACGGG CGTCGAGTTC GGACTCGGCA AAGACGCCGT GAACGTGACC GGGAGCGAGC GCAACGAGGA CTGGACATTT GACGACGGCG AGTCGTTCGA CCATGTCGAA AGCGAGGAGG GCGATCCGGT CCCCGTCGGG AACGACCACG GCGGGCTCCA GGGCGGGATC ACGACCGGTG AGCCCATTTA CGGCGAGGCG ACGTGGCACG CGCCCACCTC GATCCCGAAA AAGCAGCGCT CCGCCGACTG GGAGACGGGC GAGGAGAAGG ACGTGCAGGT CGTCGGCCGG CACGACCCCG TCCTCCCGCC GCGGGCCGTC CCCGTCGTCG AGGCGATGCT GTACTGCACC GTCCTCGACT TCATGTTGCT CGCCGGCCGG ATCAACCCCG ACCGCGTCGA CGGCAACCCG GGCCAGTACG ACACCGACTA CCACCCGAGC AGCCCCGACA ACGATTGA
|
Protein sequence | MNGNRFGRLF QVTTYGESHG EAMGVTVSGV PAGVELDEEA IQAQLDRRKP GQSMITTSRG EPDEVVVNSG VQDGYTTGTP IGMVIQNKDA RSGKYEPYVT APRPSHGDYT YSAKFGTRNW GGGGRSSARE TVNWVAAGAV AEQVLDASEY DVEIKAHVNQ IGDVEADDVS FEQILDHSEE NDVRCADPEA AAEMQELIER YQEAGDSIGG SIYFECRGVP RGLGAPRFDG FPSRLGQAMF SIPATTGVEF GLGKDAVNVT GSERNEDWTF DDGESFDHVE SEEGDPVPVG NDHGGLQGGI TTGEPIYGEA TWHAPTSIPK KQRSADWETG EEKDVQVVGR HDPVLPPRAV PVVEAMLYCT VLDFMLLAGR INPDRVDGNP GQYDTDYHPS SPDND
|
| |