Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4041 |
Symbol | |
ID | 8744669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 295028 |
End bp | 296245 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646514607 |
Product | imidazolonepropionase |
Protein accession | YP_003405554 |
Protein GI | 284167276 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTGACC TCGTCGTCCA CAACGCGGCC CAAGTCACAA CCCCAACTGA CGATGGTGGG ATGATGACGA CACAGGACGG AGCCGTGGCA ATCGAGGATG GTGTCATCGT CGCAGTGGGA CCGACCGATG AGGTCACGCG CGAATACCCG GAAGAGAACG CGACGGATGC CGTCGACGCC GCCGGAAAGA CGGTTCTTCC GGGATTCGTC GACCCCCACA CACACGCCGT GTTTGCGGGC GACCGCGCTG ACGAGTTCAC TGCGAAGCTC AAGGGCGCGG AGTATCAGGA CATCCTCGAA GAAGGTGGGG GCATCTTGCG AACTGTGCGA GCGACGCGCG AGGCCTCCCG CGACCGCCTC GTCGAGAGCC TCCTCGCACG TCTCGACGTC ATGCTCGCCC ATGGGACGAC GACGGTCGAA GTGAAATCCG GCTACGGGCT TGACGTCGAG ACGGAAATGA AGATGCTCGA AGCAATCGGC GAAGCGGCCG ACCGCCACCC GATTGACGTG GTGCCGACGT TCATGGGTGC TCACGCTGTG CCCGATGACA CGGACACCGA CGCGTACATC GACCGGGTCG TCGAAGAGCA GCTACCTGCG GTGACGGCAA ATGATCTCGC CGAGTTCTGT GACATCTTCT GTGAGGCGGG CGTCTTCTCT GTCGATCACT CCCGGCGCGT CCTTGAAGCC GGGCAGGAAC ATGGGCTGAA ACCGAAGATC CACGCCGACG AGTTCGAGCG ACTCGGCGGA TCACAGTTGG CTGCCGACAT CGGGGCCACG AGCGCGGATC ATTTGCTGCA GTCGACCGAG GCGGACATCA CAGCGCTCGG CGAAGCGAAC GTAACGCCCG TTCTCCTGCC GGGGACTGCC TTTTCGCTCG CGACTGACTA CGCTGATGCG ACAGCCTTCG AAGCGGCGGA CGTTCCGGTC GCTATCGCGA CCGATTTCAA TCCGAACTGT TACTCGCAGA GTATGGAATT CGCAATCGAA CTCGCCTGCA ACGGTATGCG TATGAGTCCC GCCTCGGCTA TCCGTAGCGC AACAGTTACG GCTGCCAACG CTATCGACCG AACGGACGGC ACGGGTATCC TTCGAGAGAA CTCGCCCGGA GACTTAATCG TCGCCGACGT GCCCGACTAT CAGCACCTCC CGTACAATTT CGGCGTGCAA AACGTCCAAA CAGTCGTAAA GCGTGGGGAG GTGGTCCGCC ATGACTGA
|
Protein sequence | MTDLVVHNAA QVTTPTDDGG MMTTQDGAVA IEDGVIVAVG PTDEVTREYP EENATDAVDA AGKTVLPGFV DPHTHAVFAG DRADEFTAKL KGAEYQDILE EGGGILRTVR ATREASRDRL VESLLARLDV MLAHGTTTVE VKSGYGLDVE TEMKMLEAIG EAADRHPIDV VPTFMGAHAV PDDTDTDAYI DRVVEEQLPA VTANDLAEFC DIFCEAGVFS VDHSRRVLEA GQEHGLKPKI HADEFERLGG SQLAADIGAT SADHLLQSTE ADITALGEAN VTPVLLPGTA FSLATDYADA TAFEAADVPV AIATDFNPNC YSQSMEFAIE LACNGMRMSP ASAIRSATVT AANAIDRTDG TGILRENSPG DLIVADVPDY QHLPYNFGVQ NVQTVVKRGE VVRHD
|
| |