Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2447 |
Symbol | |
ID | 8384749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2521842 |
End bp | 2523584 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644973523 |
Product | PHP domain protein |
Protein accession | YP_003131346 |
Protein GI | 257053513 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.335837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGCA ACGCCGAACT CGCCGCTCGT CTCGAGGAAT TCGCCGACCT GCTGGACGCC AAAGACGTCG AATACAAACC ACGCGCCTAT CGCCGGGCGG CCGAGAACGT CGCCGAATAT CCCGGGGACG TGGTCGATCT CGCCGAAGAC GGGGTCGAGG CAGTCCAGGA AATCGACCGC GTCGGCGAGG CGATCGCCGA CAAGCTCGTC GAGTACGTCG AGACCGGCGC GATCGAGGAA CTCGAGGATC TGCGGGCGGA ACTCCCCGTC GAGATGGACG CGCTGACGAG CGTCGAGGGC GTCGGGCCGA AGACCGTCGG CACGCTCTAT GAGGCACTCG GAATCCAAAC CCTCGACGAC CTGGAGGCGG CCGCCGAGGC GGGGGAGATC CAGAACGTCT CGGGGTTCGG GGCGAAGACA GAGCAGAACA TCCTCGATGG GATCGACTTC GCCAGAGAAG CCCACGAACG CCAGTTGCTG GGCGAGGCCC GGCCGCGGGG CGAGCGCGTG CGCGAGTACC TACGAGCGGT GGAGGCCGTC CAGCAATGTG AACTCGGCGG GTCGCTTCGC CGCTGGAAAC CGACGATCGG CGATGTGGAT GCGCTGGTCG CAAGCACCGA GGGTCCGGCC GTGGTCGAAG CATTCACCGA CTGGGACGGG GCCGACCGGG TGATCGAAGC CGGCGAGGGG AAGGCGAGCC TGCGCGCCGA CGGCGTCCGG ATCGACCTCA GGGTGGTCGA TCCAGCCGAA TTCGGCGCGG CGTTGCAGTA CTTCACCGGA AGCAAGGACC ACAACGTCGA CCTGCGCAAC CGGGCGATCG ATCGGGATCT CAAACTCAAC GAGTACGGCG TCTTCGACAT TTCGGCTGTG GAGGGGGAAG TAGACGATCA GCGGGCTGGC GAGCGCGTCG CCGGCGAGTC AGAGGCCGAG GTCTACGAGG CGCTCGATCT CCCCTGGATC CCGCCGGAGT TACGTGAGAA CCGCGGTGAA ATCGCGGCCG CCGACGCGGG TGCGCTGCCC GAGTTGATCG AGACGGCCGA GATCCGCGGC GACCTCCACG TCCACACGGA ATGGTCGGAC GGCAACAACA CGATCGAAGA GATGGCCGCG GCTGCCGCCG AACGGGGTGA CGAGTACGTC TGCATCACCG ATCACGCGAC CGGGCCGGGC ATGGTCGGCG GCGTCGGGCT CACCGACGAC GAACTCCGCG AGCAGCGCGA GGCGATCGAG GCGGTTGAGG ACGAACGCGA CGATATCACG GTCCTGACGG GCGTCGAAGC AAATATCGAT ACCGACGGCG GAATCTCTGT CGGTGATAAG GTGCTCGAAA CGCTGGATCT GGTCGTCGCC TCGCCACACG CCGGCCTCGA CGGCGACGGA ACCGACCGCC TGATCGAAGC CGCCCGCCAC CCCGCAGTGG ACGTGATCGG CCATCCGACG GGGCGACAGC TCAACCGCCG CCCCGGCCTC GACGTGGACG TTTCCGCCGT GGCCGAGGTG GCGGCCGAGC ACGACACGGC GCTGGAGGTC AACGCCAACC CGCATCGGCT CGATCTCGAA GGGAGCCAGG TCAAGCGGGC GATCGACGCG GGGGCGACGA TCGCGATCGA CACCGACGCG CACCGACCTG AAGCTCTCGA TTACCGGCGC TTTGGCGTCC ACACCGCGCG GCGCGGTTGG GCGGAAGACG CAAACGTACT CAATGCTCGA TCGGCTGCCG GCCTCCATGA CTTTCTCGGG TGA
|
Protein sequence | MTSNAELAAR LEEFADLLDA KDVEYKPRAY RRAAENVAEY PGDVVDLAED GVEAVQEIDR VGEAIADKLV EYVETGAIEE LEDLRAELPV EMDALTSVEG VGPKTVGTLY EALGIQTLDD LEAAAEAGEI QNVSGFGAKT EQNILDGIDF AREAHERQLL GEARPRGERV REYLRAVEAV QQCELGGSLR RWKPTIGDVD ALVASTEGPA VVEAFTDWDG ADRVIEAGEG KASLRADGVR IDLRVVDPAE FGAALQYFTG SKDHNVDLRN RAIDRDLKLN EYGVFDISAV EGEVDDQRAG ERVAGESEAE VYEALDLPWI PPELRENRGE IAAADAGALP ELIETAEIRG DLHVHTEWSD GNNTIEEMAA AAAERGDEYV CITDHATGPG MVGGVGLTDD ELREQREAIE AVEDERDDIT VLTGVEANID TDGGISVGDK VLETLDLVVA SPHAGLDGDG TDRLIEAARH PAVDVIGHPT GRQLNRRPGL DVDVSAVAEV AAEHDTALEV NANPHRLDLE GSQVKRAIDA GATIAIDTDA HRPEALDYRR FGVHTARRGW AEDANVLNAR SAAGLHDFLG
|
| |