Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1793 |
Symbol | |
ID | 8544175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2476961 |
End bp | 2479918 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646386499 |
Product | SNF2-related protein |
Protein accession | YP_003266234 |
Protein GI | 262195025 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00186852 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.607723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTTC AGTCGCTGCA GAAGGCGGTG CGCCGCGCGT CGACCACCCA GGTCTGGTCT CAGGGCGTCC GGCTGTCGCG CGAGGACGCG GTCACCGGGG AGAGCGCCGA CCCGGACGAG GTCGTGCTGC GGGTCAAGGC GCCCGGGCGC GCGGTGGCGC CGACCGTGGT GCTGTACCTC GAGGACCTCG AGTGGGAGTG CGATTGCCCC AACAGTGAGG ACACCTGCGC GCACGTGGTC GCGGCCGTGC TGGCGCTGGT CCAGGCGCGT CAACAGGGCC ACGAGCTGCC GCAATCGCAG AGCGCGGGCG GCAAGGTGCG CTATCACTTT CGCCGCGAGG GCGGCGGGCT GGCGCTCGAC CGCGTCATCG TCGCGGCCGA CGGCGCCGAG CACCCCATCC CGCAGACCCT GTCGGCGCTG ATCTCGGGGC GCACGCCCGG ACCCTCGGTG GCGCCCGAGC AGTCCGACCT GAGCGTCGAC CAGCTTCTGG GCACGCGCGT GCGCGGTCGC CTGCCGGCCG ATCGCGTCAT CCATCTGCTC ACCGCGCTGG CCGGTCACGG CGAGATCGTG CTCGACGGCG AGCCCGTGAG CATCTCCGAC GAGGTGCTGG TGCCGCGCGC CGTCGTCGAG GACAGCAAGG GCGGCGCGCG CCTCACCATC GCCCGCGATC CCCGCATCGA CGAGGTCGTG ACCCTGGGCG TGGCGCGCTG CGCCCAGGTG CTCCATCGCC TGGGCGAGAC CGAGCTGTCG GGGCAGCGCC TCGAGCATCT GCCGCGGGTG GAGGAGGTGC CGCGCACCGA GCTGCACACC CTGCTCGAGC AGACCGTGCC CGCGCTTGAG CGGCGCATCC CGGTCGAGGT CAGGACCCGG CGCGCGCCCG AGGTGGCCCG GGACGCGCGT CCGGATATCC GCCTCGACGT GCGCCAGGAG GGCCACCGCC TGGCCGTGCT GCCGACCTTG GTGTACGTCG AGCGCAACGG CGACGAGCCC ATCGCGCGCA TCGACGGCGG CCGCCTGGTG CACCTGCGCG GCCCGGTGCC CGTGCGCGAC GAAGCCGGCG AGAAATACCT GCTGCAGCGG CTGCGCGACC AGCTCAACAT GATGCCCGGG CGGCGCGTGG AGTTCGAGGG CCGCGACGCG GTGAGCTTTC AGAAGAAGCT CGCCGACTGG AGCGGGCGGC TCACGGGCGA TGTCGAGCGC GAGCGGTTTT TGGACGCCCC GCCCATCATC CCGCGGCTGC AGCTCGACGG CGCCGCGGCC GATATCTATT TCGAGGCGCC GGCGGGCGAG GGCGGCGCGG GCGGCGAGGG CGACGCGCAG CGGATCGATG CCGCGACCGT GATGCGCGCG TGGCGCCAGG GCGATGGCTT CGTGGCCCTG CCCGCGGGCG GCTGGGCGCC GCTGCCGCAG GCGTGGCTGC GCGATCACGG CCACCTGGTG CTCGACCTGC TGGCCGCGCG CGATGAGCGC GACCAGGTGC CGCCCTTCGC GCTGCCCGAT CTCGCGCGCC TGTGCGCGCT GCTCGAGCAC CCGCCGCCGC CCGGACTCGA CGCCCTGGCG CCGCTGTTCG AGCAGTTCAC CGAGCTGCCG CCGGCGCCGC TCCCGGACGA CCTGCGCGCC GAGCTGCGCG GCTATCAGCA CACCGGCGTG CGCTGGCTCA GCTTTTTGCG CGACGCCGGT CTGGGCGCGG TGCTGGCCGA CGACATGGGT CTGGGCAAAA CCCTGCAGAC CCTGTGCGCG CTGCGCGGGC GGAGTTTGGT CGTGTGTCCG ACAAGCGTGG TGCACAACTG GGCCGACGAG CTGCGGCGTT TTCGACCGGC CCTGCGCGTG GCCCTGTACC ACGGCCCGCG CCGCGAGCTC GATCCCGAGG CCGACGTGGT GCTCACCAGC TACGCGCTGC TGCGCCTCGA CATCGAGCAG CTCTCGGCCA TCGCCTGGGA CGCCGTGGTG CTCGACGAAG CCCAGGCCAT CAAAAACCCC GAGAGCCAGG TCGCGCGCGC GGCCTTTCGC CTCGACGCCG GCTTCCGCGT GGCGCTCAGC GGTACCCCGG TCGAAAACCG GCTCGACGAG CTGTGGAGCC TGTTCCACTT CGTCAACCGC GGTCTGCTCG GCGGCCGCAA GAGCTTCAAA GAGCGCTACG CCGACCCCGT GGCCCGCGGC GAGGACGGCG CCGCCGAGCA GCTTCGCGCG CGCATCCGCC CGTTTCTGCT GCGCCGGCGC AAACGCGAAG TCGCGCCCGA GCTGCCGCCG CGCACCGAGG CGGTGTTGCA CTGCGAGCTG TCCGATTCCG AGCGCGCGGC CTACGACGCG GTGCGCGCGG CCACCCAACG CGACGTGCTC GAGCGCCTGG CCCACGGCGG CGGCGTGATG GAGGCGCTCG AGGCGCTGCT GCGTCTGCGC CAGGCCGCGT GTCACCCCGC GCTCCTGCCC GGACGCGAGG CCGATACCTC GGCCAAGATG GAGATGCTGG TCGACGCCCT GAGCGTGGTC GCGGCCGAGG GCGGCAAAGC GCTGGTGTTC TCGCAATGGA CAGGTCTGCT CGATCTCATC GAGCCACACC TGCGCGCGGC CGAGATATCG TTCAATCGTC TCGACGGCAG CACCCGCGAT CGCGGCGGCG TGGTCGCCGC GTTTCAGGAC GAGAGCGGTC CCACGGTGAT GCTCATCTCG CTCAAAGCCG GCGGCACCGG ACTCAATCTC ACCGCGGCCG ATCATGTCTT TTTGTGCGAC CCGTGGTGGA ATCCGGCAGT CGAGGAGCAA GCGGCCGATC GCGCCCATCG CATCGGACAG GATCGTCCGG TCATGGTGTA TCGACTGGTG AGCAAGGATA CTGTCGAAGA GCGCATACTA GCCCTGCAAG AGCAGAAGCG CGCCTTGGCC GAAGCCGCGA TCGGCGAGGG CGCGCGCGCC GCGCAGCTCA CGCGCGACGA TCTCATGGCG CTGCTCGCGG CCGGTTGA
|
Protein sequence | MDLQSLQKAV RRASTTQVWS QGVRLSREDA VTGESADPDE VVLRVKAPGR AVAPTVVLYL EDLEWECDCP NSEDTCAHVV AAVLALVQAR QQGHELPQSQ SAGGKVRYHF RREGGGLALD RVIVAADGAE HPIPQTLSAL ISGRTPGPSV APEQSDLSVD QLLGTRVRGR LPADRVIHLL TALAGHGEIV LDGEPVSISD EVLVPRAVVE DSKGGARLTI ARDPRIDEVV TLGVARCAQV LHRLGETELS GQRLEHLPRV EEVPRTELHT LLEQTVPALE RRIPVEVRTR RAPEVARDAR PDIRLDVRQE GHRLAVLPTL VYVERNGDEP IARIDGGRLV HLRGPVPVRD EAGEKYLLQR LRDQLNMMPG RRVEFEGRDA VSFQKKLADW SGRLTGDVER ERFLDAPPII PRLQLDGAAA DIYFEAPAGE GGAGGEGDAQ RIDAATVMRA WRQGDGFVAL PAGGWAPLPQ AWLRDHGHLV LDLLAARDER DQVPPFALPD LARLCALLEH PPPPGLDALA PLFEQFTELP PAPLPDDLRA ELRGYQHTGV RWLSFLRDAG LGAVLADDMG LGKTLQTLCA LRGRSLVVCP TSVVHNWADE LRRFRPALRV ALYHGPRREL DPEADVVLTS YALLRLDIEQ LSAIAWDAVV LDEAQAIKNP ESQVARAAFR LDAGFRVALS GTPVENRLDE LWSLFHFVNR GLLGGRKSFK ERYADPVARG EDGAAEQLRA RIRPFLLRRR KREVAPELPP RTEAVLHCEL SDSERAAYDA VRAATQRDVL ERLAHGGGVM EALEALLRLR QAACHPALLP GREADTSAKM EMLVDALSVV AAEGGKALVF SQWTGLLDLI EPHLRAAEIS FNRLDGSTRD RGGVVAAFQD ESGPTVMLIS LKAGGTGLNL TAADHVFLCD PWWNPAVEEQ AADRAHRIGQ DRPVMVYRLV SKDTVEERIL ALQEQKRALA EAAIGEGARA AQLTRDDLMA LLAAG
|
| |