Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1523 |
Symbol | |
ID | 8383802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1499885 |
End bp | 1501024 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644972585 |
Product | 2-methylcitrate synthase/citrate synthase II |
Protein accession | YP_003130431 |
Protein GI | 257052598 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.297697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGACG AGCTCAGAAA GGGACTGGAG GGAGTGCTGG TCGCCGAATC GTCGCTGAGC GACATCGACG GCGAGGAGGG GCGACTCGTC TACCGGGGGT ACGCGATCGA GGATCTCGCA CGCAACACCT CCTTCGAAGA GGTACTGTAT CTGCTCTGGG AGGGCCACCT CCCGACTGAA TCCGAACTCG AACCGTTCGA AGAGCAGATG GCCGAGGAGC GGGCGGTCCG CGAGGAGGTA CTCTCGACGG TCCGGGAACT CGCCGCCGCC GAGGAGAACC CGATGGCCGC CCTGCGGACT GCCGTCTCGA TGCTGTCGGC CTACGACCCG GACGACTCGA CGGGCGAGGC CGGCGACGGC GAGATCGCCC AGCGGAAGGG TCGCCGGATC ACCGCGAAGA TGCCGACGAT CGTCGCCGCG TACAAACGAC TTCGGGACGG CGAAGAGCCC GTCGCACCCC GGACGGACCT GAGCCACGCC GAGAACTTCC TGTACATGCT CAACGGCGAG GAACCGGCCC AGACCCTCGC GGACGTCTTC GACATGGCGC TGGTCGTCCA CGCCGACCAC GGGATCAACG CCTCGACGTT CGCCGCGATG GTGACAGCCT CGACGCTGTC TGATCTCCAC AGTTCGGTCA CCAGCGCCAT CGGCGCACTG AAGGGAGGAC TCCACGGCGG CGCGAACCAG AACGTGATGC AGATGCTGCT TGAACTCGAC GAGAGCGACC TGACGGCCGT CGAGTGGGCC CGTGAGGCCG TCGAGTCCGG CGATCGTATC CCCGGATTCG GCCACCGCGT CTACGACGTC AAAGACCCAC GGGCGAGGAT CCTCAGTGCC AAGTCCAAGG CGCTGGGCAA GGCCGCCGAC GAACTCAAAT GGTACTCTTA CTCTCGCGCC ATCGAGGAGT TCATGGCCAA GGAGACCGGC ATCGCGCCCA ACGTCGACTT CTACTCGGCG TCGATGTACT ACGAGATGGG CATCCCGATT GACCTCTATA CGCCCATCTT CGCGATGAGT CGCGTCGGTG GCTGGGTCGC CCACGTCCTC GAATACCAGG AAGAGAACCG CCTCATCCGC CCCCGAGCGC GTTACGTCGG TCCGGAAGAT CGGGAGTTCG TCCGGATCGA GCAGCGGTAG
|
Protein sequence | MPDELRKGLE GVLVAESSLS DIDGEEGRLV YRGYAIEDLA RNTSFEEVLY LLWEGHLPTE SELEPFEEQM AEERAVREEV LSTVRELAAA EENPMAALRT AVSMLSAYDP DDSTGEAGDG EIAQRKGRRI TAKMPTIVAA YKRLRDGEEP VAPRTDLSHA ENFLYMLNGE EPAQTLADVF DMALVVHADH GINASTFAAM VTASTLSDLH SSVTSAIGAL KGGLHGGANQ NVMQMLLELD ESDLTAVEWA REAVESGDRI PGFGHRVYDV KDPRARILSA KSKALGKAAD ELKWYSYSRA IEEFMAKETG IAPNVDFYSA SMYYEMGIPI DLYTPIFAMS RVGGWVAHVL EYQEENRLIR PRARYVGPED REFVRIEQR
|
| |