Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5031 |
Symbol | |
ID | 8547441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 6941215 |
End bp | 6944442 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646389706 |
Product | Endonuclease/exonuclease/phosphatase |
Protein accession | YP_003269412 |
Protein GI | 262198203 |
COG category | [R] General function prediction only |
COG ID | [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0324828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.671967 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAGAG TAGTCATCGG AGCGCTTGCC GCTCTCTGGG TTGGGCTGAG CACGCAGCAA GCCCACGCCT CGACCGAGCT GTTCATCTCC GAGTACATCG AGGGATCGAG CAACAACAAA GCCGTCGAGA TCTTCAACGG CACCGGCGCA GACGTCGATC TCGCCGGCTA CGAGCTCCGC TTCTACTTCA ACGGCAGCAC CTCGCCCGGG TTCCAGCTCG GCCTCACGGG CACGGTGGCC GACGGCGACG TGTTCGTGGT CGCGCACAGC AGCGCGGTCG CCGCCATTCT TGACCAGGCC GATGTCGTCT CGGGCTCCGG CTTCTTCAAC GGCGACGACG CCGTGGCCCT GGTCAATAGC GGCAGCATCA TCGACGTCAT CGGCCAGATC GGCGTCGATC CCGGCAGCCA GTGGGGCAGC GGCGACGCCA GCACCCAGGA CAACACCCTG CGCCGCGCCG AGGGCTTCTG CGAGGGCGAC CCCGACGGCA GCGACGCCTT CGATCCCGCC GCCGAGTGGG ACGGCTTCGC CCAGGACAGC TTCGACGGCC TCGGCGCCCA TGAGGGCTGC GCCGCGGCGC TGGCCGAGGA CCTGTTCTTC TCCGAGTACA TCGAGGGCTC GAGCAACAAC AAGGCGCTCG AGATCTTCAA CGGCACCGGC GCCGATGTCG CGCTCGGCCT CTACCAGGTG CAGTTCTACT TCAACGGCAG CGCCTCGGCC GGCTTCACCC TGTCGCTCTC GGGCACCGTG GCCGACGGCG ACGTGTTCGT GCTCGCGCAC AGCAGCGCGG TCGCCGCCAT CCTCGACCAG GCCGATGTCA CCTCGGGCGC CAGCTTCTTC AACGGCGATG ACGCCATCGT GCTGCTCAAC GACGGCGTCA TCATCGACGT CATCGGCCAG ATCGGCGTCG ATCCCGGCAG CCAGTGGGGC AGCGGCGACG CCAGCACCCA GGACAACACC CTGCGCCGCC AGGGCGCGCT GTGCGCCGGC GACGCCGACG GCAGCGACGC CTTCGACCCG GCCGGCGAGT GGGACGGCTT CGCCCAGGAC AGCTTCGACG GCCTCGGCGC CCACGAGGGC TGCGGCGGCG GCGCCCCCGA CCCCGACCCC GACCCCGACC CCGACCCCGA CGCGGTCTTC GTCCACCAGG TCCAGGGCAC CGGCACGGCC AGCCCCATGG TCGGCGCGCA GGTCATCATC GAGGGCATCG TGGTCGGCGA CTTCCAGGGC GATCTCCTGG GCGGCTTCTT CCTCCAGGAG GAGGACGCCG ACGCCGACGC CGATCCGCTC AGCTCCGAGG GCATCTTCGT CTATCAGGGC GCCACCGGCA CCGAGGTCGC CGAGGGCGAT CTGGTGCGCG TGAGCGGCAC CGTGGCCGAG TACTTCGACA ACACCCAGCT CAGCAGCGTG AGCTCGGTCG AGATCATCGC CACCGCGCAG CCCCTGCCGG CGGTGAGCGA CATGCTGCTG CCCATGCTCA GCGCCGATGA GTTCGAGCGC TACGAGGGCA TGCTGGTGCG CCTGCCGCAG GTGCTCACGG TCACCGAGAA CTACACCCTG GGCCGCTACG GCGAGGTCTG GCTGTCCGCG GGCGGTCGCC TCATGCAGCC CACGGCCGTG GCCCTGCCCG GCGACGACGC GCTCGCGGTG CAGGCCGAGA ACGACCTCAA CCGCCTGCTC ATCGATGACG GACGCACGGC CCAAAACCCC GACCCCATCA TCTTCCCGGC GCCCGGCCTG AGCGCCGACA ACACCCTGCG CAGCGGCGAC AGCGTGGCCG GCGTGGTCGG CGCGCTCAAC TTCAGCTTCG GCAGCTACCG CGTACAGCCG ACCACGGCGC CGAGCTTCGT CGCCAGCAAC CCGCGCACGC CCGCTCCGGG CGCGGTCGGC GGCAGCTTCA AGGTCGCCAG CTTCAACGTG CTCAACTACT TCAACGGCGA CGGCCAGGGC GGCGGCTTCC CGACCGCGCG CGGCGCCGAC ACCGCGGCCG AGTTCGTGCG TCAGCGCGAC AAGATCATCT CCGCCCTGGT GGCGCTCGAC GCCGACGTCA TCGGCCTGAT GGAGATCGAG AACGACGGCT ACAGCAGCCT CAGCGCCATC GCCGACCTCA CCGCCGGCCT CAACGCCGCC CTGCCCGCTG GCGAGAGCTA CGACTTCGTC GATCCCGGCG TGTCGCAGAT CGGCAGCGAC GCCATCGCGG TCGGCTACCT CTACCGCACC CAGACCGCGG GTCTGGTCGG CGCCTCGGCC ATCCTCGATA GCTCGGTCGA CCCGCGCTTC GACGACACCA AGAACCGCCC GGCCCTGGCC CAGACCTTCG AGGAGCTGGC CTCGGGCGAG CGCTTCACCA TCGCCGTCAA CCACCTCAAG TCCAAGGGCT CCTCGTGCGA CTCGCTCGGC GACCCCGACA CTGGTGACGG CCAGGGCAAC TGCAACCTGA CCCGCACCGC GGCCGCCGAG GCGCTGGCCG ACTGGCTGGG CACCGACCCG ACCAGCTCGG GCGACGACGA CTTCCTGATC ATCGGCGACC TCAACGCCTA CGCCATGGAG GACCCCATCG CCGCGCTCCA GGCCGGCGGC TACACCGACC TGGCCGACGC CTTCATCGGC GCCGATGTCG CGTACTCGTA CATCTTCGAC GGCCAGGCCG GCTATCTCGA CTACGCGCTG GCCAACGACG CCCTGCTGGC CCAGGTGACC GGCGTGCACG AGTGGCACAT CAACACCGAC GAGCCCATCG CGCTCGACTA CAACGTCGAG TTCAAGTCGC CGGGGCAAAT CGACAGCCTG TACGACGACG GCCCCTACCG GGCCTCGGAT CACGACCCCG TGGTCATCGG CCTGGCGCTG CAGAGCGGAC CGCCCGGACG CCGCATCGCC ATCGCCGACC TCGACGGCTA CGCCGTGGCC TCCTTCGGCA CCTGGACCGC GTCGTCGGTG GTCAAGGTCA TGGACGACGC CGGCCAGCCG GTGGCCGGCG CCGAGGTCAT GGGCTCGTGG ATCGGCGGTC AGTACGAGGA CGCGAGCTGC GTGACCAACG CCAGCGGCCG CTGCACCGTG AGCGCGCCCT ACTTCTACCG CGCCGACCTG GCCTTCTTCG TGGTCAGCGA CGTGGTGCTC GCCGGCGCCA CCTACGACGC CGGCGCCAAC AGCGACCCGG ACGGCGACAG CGACGGACAC GCGGTCGAAG TCGCGGCGCC GCAGCGCCCG ACCTGGCCGC CGCGCTGA
|
Protein sequence | MIRVVIGALA ALWVGLSTQQ AHASTELFIS EYIEGSSNNK AVEIFNGTGA DVDLAGYELR FYFNGSTSPG FQLGLTGTVA DGDVFVVAHS SAVAAILDQA DVVSGSGFFN GDDAVALVNS GSIIDVIGQI GVDPGSQWGS GDASTQDNTL RRAEGFCEGD PDGSDAFDPA AEWDGFAQDS FDGLGAHEGC AAALAEDLFF SEYIEGSSNN KALEIFNGTG ADVALGLYQV QFYFNGSASA GFTLSLSGTV ADGDVFVLAH SSAVAAILDQ ADVTSGASFF NGDDAIVLLN DGVIIDVIGQ IGVDPGSQWG SGDASTQDNT LRRQGALCAG DADGSDAFDP AGEWDGFAQD SFDGLGAHEG CGGGAPDPDP DPDPDPDAVF VHQVQGTGTA SPMVGAQVII EGIVVGDFQG DLLGGFFLQE EDADADADPL SSEGIFVYQG ATGTEVAEGD LVRVSGTVAE YFDNTQLSSV SSVEIIATAQ PLPAVSDMLL PMLSADEFER YEGMLVRLPQ VLTVTENYTL GRYGEVWLSA GGRLMQPTAV ALPGDDALAV QAENDLNRLL IDDGRTAQNP DPIIFPAPGL SADNTLRSGD SVAGVVGALN FSFGSYRVQP TTAPSFVASN PRTPAPGAVG GSFKVASFNV LNYFNGDGQG GGFPTARGAD TAAEFVRQRD KIISALVALD ADVIGLMEIE NDGYSSLSAI ADLTAGLNAA LPAGESYDFV DPGVSQIGSD AIAVGYLYRT QTAGLVGASA ILDSSVDPRF DDTKNRPALA QTFEELASGE RFTIAVNHLK SKGSSCDSLG DPDTGDGQGN CNLTRTAAAE ALADWLGTDP TSSGDDDFLI IGDLNAYAME DPIAALQAGG YTDLADAFIG ADVAYSYIFD GQAGYLDYAL ANDALLAQVT GVHEWHINTD EPIALDYNVE FKSPGQIDSL YDDGPYRASD HDPVVIGLAL QSGPPGRRIA IADLDGYAVA SFGTWTASSV VKVMDDAGQP VAGAEVMGSW IGGQYEDASC VTNASGRCTV SAPYFYRADL AFFVVSDVVL AGATYDAGAN SDPDGDSDGH AVEVAAPQRP TWPPR
|
| |