Gene Hoch_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3059 
Symbol 
ID8545447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4219011 
End bp4222256 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content74% 
IMG OID646387730 
Producthypothetical protein 
Protein accessionYP_003267458 
Protein GI262196249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.893943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGG GTGAGCGGCA GACTGGTGGT TCTGAACAGC GCGCCGGCGA ACAGGGCACG 
GGTACGCGCG CGCCCGAGCG CCAGCGCGCG CGCCCGGCCC CGGGCAAGGT CACGCGCACC
GGCAAGCTCG CGAGCGGCGG CAACGGGGCC GTGCAGCGCC GGGCGGCCGA AGCCGACGCG
CCCACCGATC CCGAGGGCGG CGCGCCGCTC GACGGCGGCC TGCGCCAGCA GCTCGAGGAC
GCCACCGGCT TCTCGGGCCT GGCCAGCGTG CGCGTGCACA CCGGTTCGGC CTCGCAGCGC
GCGGCCGGCG GTCTCGGCGC CGCCGCGTTC ACCTCGGGGC AGCACATCCA CTTCGGCGCC
GGCCAGTACA ACCCCAACTC GGCCGCGGGC AAGCAGCTCA TCGCCCACGA GGTCGCGCAC
GTGGTCCAGC AGAGCGGCGC CCCGGCCAGC GCCACCGGCC AGGTCGGCAG CGCGGGCGAC
GCCCACGAGG CCGCGGCCGA TCGCTTCGCC GGCGGCTTCG CCATCGGCAT GCCCGTGGTC
CAGCGGCTGG CTCGCGGCAG CGCGCCGAGC GGCATGATCC AGCGCCGCAC GGGCACGCCG
CAGCCGCTCA CCGTGGAGGA GCTGGCCGCG CCGGTGCGCG AGCGCCTGGT GCAGAAACGC
GCCGAGGGCA CGGCCGCCCT GGTCGCCGAG GTGCGCTCGC TGCGCGCGCA GGCCGAGGGC
CAGGTGCTCG AGGCCGTCAA CCTGGCGGTA TCGCAGACGC TCACGCAGGA GGAGCAGGAC
GCGCTCGCCG GCAACACCTC GACAAGCTCG GAGGAAGGCG CCGAGGCGAC GCCGACACCC
GAGGGCGGCG AGCAGACCCC GAGGCCGGCC GCGAACGAGA ACGCCCAGGG CCAGGACGAG
GAGCCCTCGG CCACCCCCGA GGGCCCGCAG CCCGCCCCGA CCCCGAGCCC AACCGACGCG
GCCCCGCAGA ACGCCGAGAG CGAGGACGCC AGCGCCCAGG AGCCGGGCAC CGAGGGCCAG
GACGCCGAGG GCCAAGATCC GGCCGCCCAG GGCGACCAGA ACGCCCCCGG CCAGGGCGGC
GACCAGGCCC AGAACCAGGG CGGCGGCGGC GAGCAGGCTG GCGGCCAGGC CGCGACCCCG
GCGCCCATGA CCACCGAGCA GGCGCTGCAG CCGGTGGCGG CGCCGCCCGC GGACGAGCGC
GCGTTCATCC AGGGCGAGCT CGGCTTCCAC GAGAGCTGGA CCGCCATGCG CGGCTCCACG
GCCGACCGCG CCGCGCGCCT GTTCTCGGCC AACGACCTGG GCCAGGGTCT GCTCGGCGGC
GGCGTCCAGG TGCTCGCCGG CGTGGGCATC GAACAGCTCG CCAAGCGCGT CCCGGTGCCC
GGCCTGGGCA ACATGATCGG CGGCGGGCTC AGCGCCTACG CCCTGTTCTC CAACGGCGGC
GCCGGCATCC GCAACATCGC CGGCACCATC GGCGAGGGCT TCAACTGGGA CGGCAAGGGC
CCGTGGCAGA TCGCGGCCGA CGTGGTCGCC ATGATCAAGG GCGTCCTCGA CCTCATCGGC
AACATCTGCA ACATCCTCTC GGGCCTGGCC TACGCCTTCG CCGCCATCGC CGCGGTCGGC
GGCCTGCTCT CGGTGCTGTT CCCGCCCCTG GCCTTCCTGG TGCCCTACAT CCCCACGGCC
ATCAATTTCG GACGCATGTG CGGCGGCGTG GCCACCGTGT GCATGGGCAT CTCCGACCTG
ATATCGCCCA TCCCGCCGGT GCTGCGCGCG ATCCACATCC TGGTGTCGGA CCAGGACCCG
CTGCAGCTCG CGCAGCAGGA GCAGACCTAT CACGGCGAGC TCCAGGGCGC GATCGCCAGC
TACAGCGCGG CCGGCGCCAT GTCGGCGATC GAGGGCAAGG GCTTCAACCC GGTCGGCAAC
ATGGTCGGCG GCGTGGCCGA CGGCGCCGGG GTCGCGCGCA GCGCATACGG CGACGCGCGC
GCCGGCAACA CCGCCGTCAA CCGCGCCTTC GGCTTCGAGG GCGGCGACAC CCAGGGCATG
CGCGGCGCGG GCCAGAACAA CGGCGAGCGC TACTTCGACG TCAGCGGCCA GCAGCGCACG
CGGCTGCAGG AGAACACCGA GTCCGAGCAG CGCCTGCTCG GCAGCCGCCA GGACAACGCG
CAAGAGCACC GCCGCGCCGC CGACGAGGCC CATCAGCGGG CCCGCGACAA CGCCGACGGC
GGTCGCCGCG TGCGCCGCGA GGCCGAACTC GCCGAGCGCC GCGCGGTCAA ACGCGAGCAG
CGCGTCGAGC AGTCCGAGGG GCGCGTCGCC GACGCGAAGA ACCAGCAGGA GCTGCAGAGC
GGCATGCAGG CCGGCGGCCC CGGCGGCGAG GTCGGCAACA CCACCGAGAA CGCCCGCCAG
GCCTTCCAGA GCAACGGCGA GAGCGAGCGG CAAGAGCCGG GCGGCGGCGC CATCGAGCGC
GACCAGCGCG GCCACGTGGT CCTGCCGCCC CCGCCCGGCT CGCTGCAGGA GGTCGACCAG
CAGGATCAGA CCATCCAGCA GCTCCAGCAG CGCCTGGCCG CGCAGCAGCA GCACACCCAG
GGCGCCCAGG GCGTGCGCAC AGAGGCCGGC CAGCGCTCGG CCCAGCTCGG CGCCGTGCAG
CAGACCGTGG ACGGGCGCGT CCAGGAGCAC ACGGCGCTCG AGGCCGAGCA CGCCAACGTG
GCCCAGCAGA ACGCCGACGT CAGCAGCCGA ACCCAGGAGC AGAGCAGCAG CTCGGACAGC
GGTCTGGGCC GCGCCGCCGA GGTGCTGTCG CCGATGGTGG GCCCGGCCCA GACCGTCAAC
GATCTGGTGC AGCGGGTGCC CTCGAACCGC TTCTTCGACG TCTCCGGCAC CCAGCAGAGC
CTGAGCCAGT TCGTGCAGGG CATGGAGCAG ATCACGGGCG GGCGCGACGA CGCGCAGGAG
CAAAGCAGCC AGACCCAGCA GGTGCTGCAG TCGCGCGAGC AGCAGACGGC TGAGGCCGAC
CAGCTCCATC AGCAGACCAG CAGCGAGGGC CAGGAGCTGC TGATGTGCGT GCAGAACGAC
CAGGGTCAGG CCGACAGCGT GGCCCAGGAG GCCGCGTCCG AGCAGGCCAA CAGCCAGCAG
CAGGAGCAGA CCCTGGAGCA GCAGATCCAG CAGGCCATCC AGGAGCGCGA GCAGAAGTGG
AGCAGCCTGG TGGGCTGGGC CCAGCAGCAC TACTCCATCC GCCAACGCGC CAGCAGCGGA
AGCTGA
 
Protein sequence
MKQGERQTGG SEQRAGEQGT GTRAPERQRA RPAPGKVTRT GKLASGGNGA VQRRAAEADA 
PTDPEGGAPL DGGLRQQLED ATGFSGLASV RVHTGSASQR AAGGLGAAAF TSGQHIHFGA
GQYNPNSAAG KQLIAHEVAH VVQQSGAPAS ATGQVGSAGD AHEAAADRFA GGFAIGMPVV
QRLARGSAPS GMIQRRTGTP QPLTVEELAA PVRERLVQKR AEGTAALVAE VRSLRAQAEG
QVLEAVNLAV SQTLTQEEQD ALAGNTSTSS EEGAEATPTP EGGEQTPRPA ANENAQGQDE
EPSATPEGPQ PAPTPSPTDA APQNAESEDA SAQEPGTEGQ DAEGQDPAAQ GDQNAPGQGG
DQAQNQGGGG EQAGGQAATP APMTTEQALQ PVAAPPADER AFIQGELGFH ESWTAMRGST
ADRAARLFSA NDLGQGLLGG GVQVLAGVGI EQLAKRVPVP GLGNMIGGGL SAYALFSNGG
AGIRNIAGTI GEGFNWDGKG PWQIAADVVA MIKGVLDLIG NICNILSGLA YAFAAIAAVG
GLLSVLFPPL AFLVPYIPTA INFGRMCGGV ATVCMGISDL ISPIPPVLRA IHILVSDQDP
LQLAQQEQTY HGELQGAIAS YSAAGAMSAI EGKGFNPVGN MVGGVADGAG VARSAYGDAR
AGNTAVNRAF GFEGGDTQGM RGAGQNNGER YFDVSGQQRT RLQENTESEQ RLLGSRQDNA
QEHRRAADEA HQRARDNADG GRRVRREAEL AERRAVKREQ RVEQSEGRVA DAKNQQELQS
GMQAGGPGGE VGNTTENARQ AFQSNGESER QEPGGGAIER DQRGHVVLPP PPGSLQEVDQ
QDQTIQQLQQ RLAAQQQHTQ GAQGVRTEAG QRSAQLGAVQ QTVDGRVQEH TALEAEHANV
AQQNADVSSR TQEQSSSSDS GLGRAAEVLS PMVGPAQTVN DLVQRVPSNR FFDVSGTQQS
LSQFVQGMEQ ITGGRDDAQE QSSQTQQVLQ SREQQTAEAD QLHQQTSSEG QELLMCVQND
QGQADSVAQE AASEQANSQQ QEQTLEQQIQ QAIQEREQKW SSLVGWAQQH YSIRQRASSG
S