Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4042 |
Symbol | |
ID | 8744670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 296242 |
End bp | 297156 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646514608 |
Product | formiminoglutamase |
Protein accession | YP_003405555 |
Protein GI | 284167277 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01227] formimidoylglutamase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.954236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTT TCACCACACC ACTAAACTGG TCGGGGCCGT CGTCGGACCC CCTAGACGAG CAGTTCGGCG ACGTCGTCAG TGGGACTGAC CTCGACACGG CTGACGAATT CGATGCAGTC CTCGTCGGCG AGCCGTACGA CGGCGCGGTT ATTTCCCGGC GCGGCGCCGC CGAAGGTCCA GAAGCACTTA GGAACGCGCT GGCAGGGGTG AAGACTCATC ACTTCGATAC GGGGGCCATC AATAGCGTCG CTGACCTCGG GGACATCGAC GTGCCTACAG GTTCAGTTAC CGAGGTTCAG GATGAGGTCC AAGAGGTTAC AAGGTCGGTT CACGCCCTCG ATACACTCCC CATCTTTCTC GGTGGCGATA ACTCACTAAC CGTCCCGAAC GTCGCTCCGC TGCTCGAGGA TTCCAGCGTC GGTGTCCTCA ACATCGACGC CCACCTCGAC GTTCGAACAG TCCGAGACGG ACCGACCAGC GGAACACCCT ACCGGCAGTT ACACGAGGCC GGACTTGACA GCTACACTTG TCTCGGCGCT CGACACTTCG AGACGAGTAC CGCCTATCAC GACTATGTCC GTGAGAACGG GGGAACGGTC GTGACTGCCG ACGAGGTCGC GGCAGACCTC TCCGAAGCCG TCGACCGGGC ACTCTCGAGT CTCGGCACCG TCGACCGCAT CTATTGCAGT GTCGATATCG ACGTGCTCGA TGCGAGCTAT TGCGGGTCTA GCGCCCCGAC GCCCGGCGGG TTACTGCCAC GCGAACTGTT CCGACTCATG CGTCTCGTCT CCGACGACGA ACGACTTGCG GGCTTCGAAC TCGTCGAGTG TGCGCCGCCA CTCGACACCG ACGGGCGAAC CGTTGATGCT GCAGCCCGTA CTGTGGCACA CTTCCTCTCC GGGTGGTCGG CGTGA
|
Protein sequence | MSSFTTPLNW SGPSSDPLDE QFGDVVSGTD LDTADEFDAV LVGEPYDGAV ISRRGAAEGP EALRNALAGV KTHHFDTGAI NSVADLGDID VPTGSVTEVQ DEVQEVTRSV HALDTLPIFL GGDNSLTVPN VAPLLEDSSV GVLNIDAHLD VRTVRDGPTS GTPYRQLHEA GLDSYTCLGA RHFETSTAYH DYVRENGGTV VTADEVAADL SEAVDRALSS LGTVDRIYCS VDIDVLDASY CGSSAPTPGG LLPRELFRLM RLVSDDERLA GFELVECAPP LDTDGRTVDA AARTVAHFLS GWSA
|
| |