Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3679 |
Symbol | |
ID | 8546069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 5059054 |
End bp | 5061942 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646388347 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_003268073 |
Protein GI | 262196864 |
COG category | [R] General function prediction only |
COG ID | [COG1480] Predicted membrane-associated HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.3383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000942474 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGAAA CCGAACGCAA CGACGAGATG CGCGCCACGC GCTCGCAGCT CGCCAAGGTC ATCCGCCGCC GTCACACGGT GGGTCTGGCT CTGGCCATCT TCATGAGCCT CGCGTTCGCC GCCGTCACCG CCCCGCTGGT CGCCATCGAT CTCCTGCTGC CGACCACCGG GGCGGTCTCG TTCGAGGTCG GCAAACCGGC GCCGATCACC GTGCGCGTGC CCCGCTTCTC GGGCTTCTCC GACGGCAGCG TGGAGCTGAG CCCGGGCGTG CTGGTGTCGC GCGGGACCAT CGTCGACCGC GAGGACTATC AAAACCTGCA GGTGCTGCGC GCCAACGGGC CGGACTCGTG GACCGCGGTC GGCGGCTACT TCGTGCTGCT GCTCGCGGTG GCGCTGATGT TCACCATCCA CCTGCGGCGC TCGCACCGCG GCCGGCTGCT GGCCACGCAG GCCTACACCA TGCTGCTCTT GCTCGGCTGC ACCATCCTGG CCGAGATCGC GCTGTTATTT TCATCGATGT CGGTGTTCCT GGTGCCGGTG GCGTGTCTGG CCATCGTCGC CACCGTGGTC GTCGACGTCT CCGCCGGCAT CGCCTCGGGG TTCCTCGCCA GCGTGCTCAT CGGCCTGCTG GTGCCCTTCG ACCTGGGCGT GGTGCTGGTG CTGGTGCTGC AGACCACGAC CGCCTCGCTG GTGGTCGGCG AGGGCCGGCC GCGCAACCGC CGCATCTTCG CCGCCGGCCT CATCGGCGGT GTGTGCGCGG CCATCGGCTA CATCGTGCTG TGTTACCTGA CCACCAAGCA CTCGCCCTTC GCCGAGCTGG CCTCGCCCAC GCGCTCGCCG CTGGCGGCGA CCGTGGCCGG CGGCGTGCTC AGCGGCCTGC TGGCCATCCC GCTCAAGCCG CTCTACCAGT ACCTGCGCGG CGATATCACG CAGTCCAAGC TGGTCGAGCT CGAGGACCTG TCCAATCCGC TGCTGCGCCA GATCGCGACC AACTCGCCCG GCACCTGGCA GCACAGCCTG GCCATGGCCA ACATGGCCGA GATCGCGGCC AACGCCATCG GCGCCGACGG CCGCCTGGTG CGCGTGGGCG CCTACTACCA CGACCTCGGC AAGTCGCTGC AACCCAAGTA CTTCATCGAG AACCTCGAGG CCGGCGAGAC CAGCCCGCAC GATCGCCTGC CGCCCGACGT CTCGTGCGAC GCGATCTTCG CCCACGTCAC CGAGGGCATC CGGGTGGCCC GCAAGAACCG CCTGCCCGAG CGCATCATCG ACTTCATGTA CATGCACCAC GGCGACGGGC TGCTCGAGTA CTTCTGGGCC AAGTGTCGCG AGAGCGGCAA CCCCAAGGGG CTCGTCGAGG ACGATTTCCG CTATCCCGGG GTGCCGCCGC AGAGCCGCGA GACCGCGATC CTGGCCATCG TCGACGCGGT CGAGGCGGCC TCGCGCACGC TCAAGAAGCC CGACGAGCGC GCCATCGAGA GCCTGGTGCA GCGCATCGTC TACGGCAAGC TGCACCTCGG CCAGCTCGAC CAGTCGGGGC TGAGCATGTC CGACCTGCGC AAGATCTCGG ACTCGCTGCG CGAGACCATC AAGCACGCCC ACCACGGCCG CATCGAGTAC CCGTGGCAGC GCGAGGAGCG CAAGAAGAAG GCCGCTGAGG CGGCCGCGGC CAAAGGTCTC GCCGCCCCTG CCGACACCGA CACGGACGTC GCCGCCGAGC CCGCGGCCGC CGCGCCGCCA GCGGCCGCGA GCGCGCCGCC GCCGGTGTCG GCCACCCAGC GCATCATCCA AGAGCCGCGG CTCGACTCGC TCGACGTGCC GCGCCCCTAC TGGCAGGGTC GCCGGCGCAG CAGTCAGGAG CCGGTGCTGG CCACCGCGCC CACCGAGGAG CTGGCGCCGC CGCCGGCCAA GCCCCAGCGC GCGCGCGCCG ACAGTGACGA TATCGGCCAC TCGGCCACGC TCGACATCGA GATCGTGGCC GCCGACGCCG ACGCCGACGC CAGCCCGGCC GCGGCCGCGA ACAACGGCGC CAAGGCGGCA GCGGCGGGTG CTGGGGCGGA CGAAGCCGAG ACGGCGTCCG ACTACCCGTA CCTGGAGTCG GCCAGTCAGT CGATGTCGGC GCTGCCGATG GCGGCTGAGC CCGACGACGA AGGCGACGAC AACGCGGTGG ACGAAGGCGG CGACACCACC GGCGCGGCCC CGCCGAGCCA GGCGCCGACG CTGTCGCTGC TCACGGCCGA GCCCGATATT GACACCGCGG CCGCGGCCGC GACCCCTGCT CCGGCCCCGA GCCAGCCCGC GGCCCCCGAG GAGGTGCGCG CGCCGCTGCC GGCCTCGGTG ACCCCGCGCG CGCCCGCCGA TATCGAGGCC GCGAGCGGCG ACGACGAAGC CCTGGCCCAG GTCCACGCCG CCGAGCGCGC CGAGCGCGCC GCCGTGCTGG TCGCCGCGGC GTTGGCGCAT GGGGCGCCCG ACGACGACGA CGACGACCCC GCCGCGGGCG ACGGCAACCG CGACGGAGCC GAGGGCGCAG CCAGCAACGC CACGCCCATC CCCATGGGCA CCAGCGTCAC CGGGCCGCCG CCGGCCACGC GCACGCGGCC GGTCGCGCGT CTGCGCGCCC GCGACAGCGG CATTCGCGAC AGCGGCATTC GCGACAGCGG CGCTCGCGAC AAGCCGCTGG GCAGCACCAA GCTGGGCTTC CCGGGCGCCG CGCAGGCGAT CGAGGAGGTC GCCGCCGGGC GTCCGAGCGG CGAGGCCGGA GCTGCCGACG AGCCCGCCGA GGGCGCGGCT GCCGCGAGGC CAGAGACGTC CGAGGGGGCC TCCGAGGGCG CCGCTGAGGC GGGCGCCGAG GAGAGGATCG ACCTGCCGCC CTCGGCGCGC GCCAAGTGA
|
Protein sequence | MSETERNDEM RATRSQLAKV IRRRHTVGLA LAIFMSLAFA AVTAPLVAID LLLPTTGAVS FEVGKPAPIT VRVPRFSGFS DGSVELSPGV LVSRGTIVDR EDYQNLQVLR ANGPDSWTAV GGYFVLLLAV ALMFTIHLRR SHRGRLLATQ AYTMLLLLGC TILAEIALLF SSMSVFLVPV ACLAIVATVV VDVSAGIASG FLASVLIGLL VPFDLGVVLV LVLQTTTASL VVGEGRPRNR RIFAAGLIGG VCAAIGYIVL CYLTTKHSPF AELASPTRSP LAATVAGGVL SGLLAIPLKP LYQYLRGDIT QSKLVELEDL SNPLLRQIAT NSPGTWQHSL AMANMAEIAA NAIGADGRLV RVGAYYHDLG KSLQPKYFIE NLEAGETSPH DRLPPDVSCD AIFAHVTEGI RVARKNRLPE RIIDFMYMHH GDGLLEYFWA KCRESGNPKG LVEDDFRYPG VPPQSRETAI LAIVDAVEAA SRTLKKPDER AIESLVQRIV YGKLHLGQLD QSGLSMSDLR KISDSLRETI KHAHHGRIEY PWQREERKKK AAEAAAAKGL AAPADTDTDV AAEPAAAAPP AAASAPPPVS ATQRIIQEPR LDSLDVPRPY WQGRRRSSQE PVLATAPTEE LAPPPAKPQR ARADSDDIGH SATLDIEIVA ADADADASPA AAANNGAKAA AAGAGADEAE TASDYPYLES ASQSMSALPM AAEPDDEGDD NAVDEGGDTT GAAPPSQAPT LSLLTAEPDI DTAAAAATPA PAPSQPAAPE EVRAPLPASV TPRAPADIEA ASGDDEALAQ VHAAERAERA AVLVAAALAH GAPDDDDDDP AAGDGNRDGA EGAASNATPI PMGTSVTGPP PATRTRPVAR LRARDSGIRD SGIRDSGARD KPLGSTKLGF PGAAQAIEEV AAGRPSGEAG AADEPAEGAA AARPETSEGA SEGAAEAGAE ERIDLPPSAR AK
|
| |