Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4106 |
Symbol | |
ID | 8744734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 370328 |
End bp | 371326 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646514663 |
Product | formiminoglutamase |
Protein accession | YP_003405610 |
Protein GI | 284167332 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01227] formimidoylglutamase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA CCGAGGAGTC GATGAACGAG TTCACGATCT CCCCCGACTG GACCGACGAC GAATGGCGCG GAATGGCTCA ATCGACCGAC CCGAACGACG AACTGGTCGG CCATATCGTC GAGGGAATGA CACTCGAGGC CGTCGACGAC GCCGACGTCG ACGCGGTGCT GGTGGGCGAA CCGTACGACG GTGCAGTTAT CAGTCGATCG GGGGCGCGCG AGGGGCCGAC CGAGATCCGC CGCTCGCTTG TGCGAACGAA AACCCACCAC TTCGACTGTG GCCCGCTTCG GGTACTCGGT GACCTCGGCG ACGTTCGCTC GCTGGTCGAC GCTGGCACGC CCGGCACCGA CTCGTCCGTC GCCGTCGTTC AGTCGACGCT CCGTGAGACG ACGGCCCGCG TGCACGAGTG CGACGCGGTA CCGATTTTCC TCGGGGGAGA CAACTCTCTG ACGTACCCGA ACGTCGCCCC GCTACTCGAA CAGAGCTCCG TGGGCGTGAT CAATCTCGAC GCGCATCTGG ACGTCCGCGA GGTCCGGGGC GAACCGACGA GCGGCACGCC GTACCGACAG CTCTTCGCAG CCGGTCTCGA TCAATACGTC TGCCTCGGGG CGCGACACTT CGAGACGGCA ACCCCGTACC ACGAGTTCGT CCGTGAGCGT GGCGGCGCGG TCATCACGGC CGAAGAAGTC GCGGATGACG CCGTTGAGAC GGCGACGCAC GCACTCGATG CGATGGGTGA CGTCGATCGA CTCTACGTGA GCGTAGACTG CGATGTACTC GACGCGAGTG CAGCCCCCGG CGTGAGTGCG CCGACGCCGG GCGGCATCAC CACGCGAGAG CTGTTTCGCT GCCTGCGACT GCTTACGAGC GACGAGCGAC TCGCGGGGTT CGAGGTTGTC GAATGTGCCC CGCCGCTCGA CCGGAATGGA CTGACGACCG ATGCGGCGGC CCGTGCCGTT GCGCACGCCC TTGCCGGCTT TCTGGGGGGA CAACAATGA
|
Protein sequence | MTDTEESMNE FTISPDWTDD EWRGMAQSTD PNDELVGHIV EGMTLEAVDD ADVDAVLVGE PYDGAVISRS GAREGPTEIR RSLVRTKTHH FDCGPLRVLG DLGDVRSLVD AGTPGTDSSV AVVQSTLRET TARVHECDAV PIFLGGDNSL TYPNVAPLLE QSSVGVINLD AHLDVREVRG EPTSGTPYRQ LFAAGLDQYV CLGARHFETA TPYHEFVRER GGAVITAEEV ADDAVETATH ALDAMGDVDR LYVSVDCDVL DASAAPGVSA PTPGGITTRE LFRCLRLLTS DERLAGFEVV ECAPPLDRNG LTTDAAARAV AHALAGFLGG QQ
|
| |