Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4105 |
Symbol | |
ID | 8744733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 369048 |
End bp | 370331 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514662 |
Product | imidazolonepropionase |
Protein accession | YP_003405609 |
Protein GI | 284167331 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCCG CGACACCAGA CCCCAGGGGG CCGCTCTGTA TCGTGTACAA CGCGGGGGAA CTCGTCGTTG GGCCGGTCGA TGACACAGCC GACACGGCAC CAGCCGACGA CGCGCGCACC GACGGCCCGC TGGAGATTCG CGAGGATGCG GCGTTCGCCG CGATCAACGG CGAGGTTGTC GCCGTCGGCC CAACCGACGA GATCACGCGG GCGTATCCGC CCGACAATGC CGCCACCGCG ATCGACGCGG ACGGGAACGC CGTCGTCCCG GGATTCGTCG ATCCGCACAC GCACGCTCTC TTCGCGGGAG ACCGATCGGA CGAGTTCGCG GCCAAGGTGC GCGGCCGGAG CTACCAGGAG ATCCTCGCAG AGGGTGGCGG GATCCTCCGA ACCGTCCGTG CGGTCCACGA CGCGAGCGAC GACGACCTCC TCGCGAACCT GCGATCACAC CTCGACGTCA TGCTCGCCCA CGGGACCACT ACTGTTGAGG TCAAATCCGG CTACGGCCTC GAGGCCGAGA CCGAGCTCCG ACTGCTCGAG ACGATCGATC GCGCCGCCGC CAAACACCCG ATCACTCTCG TACCGACGTT CATGGGAGCA CACGCGGTTC CGGCGGACAC GGACACCGAA GACTACGTTG ACCGCGTCAT CTCGGACCAG CTCCCCGCCG TCGCCGACCA AGGGATCGCC GAGTTCTGTG ACGTCTTCTG TGAGGCGGAT GTGTTCGATG TCGACCAGTC TCAGCGCGTG CTCAAGGCGG GTGCAGCCGC GGGACTCACG CCGAAAATCC ACGCCGAGGA GTTTACCCGT CTCGGGGGCG CACAGCTCGC CGCCGAGCTC GAGGCTGCGA GCGCCGATCA CCTGTTGCAC GCGACGGCCG AGGACGTCGC GGCGCTCGTC GAGGCAGCCG TCGTTCCCGT GCTCTTGCCC GGGACGGCGT TCGGCCTCGG TGCAGCGTAC GCCGACGCGC GAGCGTTTCT AGAGGCGGGG GCTTCCGTTG CGCTCGCGAC AGACTTCAAT CCAAACTGTC ACGTACGGAC GATGGAGTTC GTCCAGACCC TCGCCGTCAT GAAGATGGAC CTCACGCCGG CCGAAGCGCT GCTCGCAGCG ACGCGCAACG CGGCACTCGC GATCGACCGA GACGACGGCA CTGGGACGCT TCGCGAAGGG GCCCCTGCGG ATGCCGCGGT GCTGGCAGCG CCATCGTACG TCCACCTCGC GTACCGGTTC GACACCACGG CCATCGAGAC GGTCGTCAAA AACGGCAGGG AGGTAGGGAC GTGA
|
Protein sequence | MTAATPDPRG PLCIVYNAGE LVVGPVDDTA DTAPADDART DGPLEIREDA AFAAINGEVV AVGPTDEITR AYPPDNAATA IDADGNAVVP GFVDPHTHAL FAGDRSDEFA AKVRGRSYQE ILAEGGGILR TVRAVHDASD DDLLANLRSH LDVMLAHGTT TVEVKSGYGL EAETELRLLE TIDRAAAKHP ITLVPTFMGA HAVPADTDTE DYVDRVISDQ LPAVADQGIA EFCDVFCEAD VFDVDQSQRV LKAGAAAGLT PKIHAEEFTR LGGAQLAAEL EAASADHLLH ATAEDVAALV EAAVVPVLLP GTAFGLGAAY ADARAFLEAG ASVALATDFN PNCHVRTMEF VQTLAVMKMD LTPAEALLAA TRNAALAIDR DDGTGTLREG APADAAVLAA PSYVHLAYRF DTTAIETVVK NGREVGT
|
| |