Gene Hoch_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3164 
Symbol 
ID8545552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4352904 
End bp4356266 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content75% 
IMG OID646387831 
Producthypothetical protein 
Protein accessionYP_003267559 
Protein GI262196350 
COG category[S] Function unknown 
COG ID[COG4676] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.875829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGA CCCACCCGTC TCTGCGAACG GCCGCCCTGG CCGCGCTCGC CACCCTCGCC 
GCTGCCTGTA CGACCTCGCC GGGCCCGGAC CCGAGCCCGG TCAAACCCAA CCCGCCCAAG
CACACCGCGG GCACCGAGTC CGGCGACCCC GACGCGCCGC CCGAGCGCCT CAGCCCGCTG
GCGGTGACCG AGGCCAGCGC CAGCTCCTTC GCCGGCTTCG ACTCGAGCGC GCTGCTGGCC
GCAGCCACGC GCTATCCCAC GGCCGAGCTG GCCACCGACG TGGCCCCGCC GATGTCGCTC
ACGGCCTCGG ACGGCACCGG CCTGCGCCTG GTGGCCCTGG ACGCGCGCGC GGTCATCGAG
GGCCCGCTGG CCTTCACCGA GCTGCACCTG TCCTTCCAGA ACCCGCGGCC CAATACCATC
GAGGGCCGCT TCGCCATCAC CTTGCCGGAG GGCGCGGCCA TCAGCCGCCT GGCCATGCGC
CTGCCCTCGG GCTGGCAGGA GGCCGAGGTG GTCGAGCGCC AGGCCGCGCG CCGCACCTAC
GAGGACTTCC TGCACCGCCG CCAGGACCCG GCGCTGCTCG AGAAGGAGGC CGGCAACGAG
TTCCGCGCCC GCATCTTCCC GATCGCGGGC AACGAGCGCA AAGACATCAT CATCTCGTAC
TCGCAGAACC TGATCGGTGA CCAGGCGGTG TATCGCCTGC CGCTGCGCGG GCTGCCGGCC
GTGGACGAGC TGAAGGTGAG CGCCCTGGTC GGCGGCCGCA AGGGCGAGGG CCTGGCGTTC
AAGCCCGCGA CCATGAGCCT GCAGCAGCAG GCGCCGCAGC GCGACTTCGA GGTGGTGCCG
CCGCCCGCGG CCCAGCGCCC GCGCGGTCTG CGCTACGGCG CCCAGCTCGT GGCCCGCGTG
GCCCCCGAGG TGGCCACCGC CGAGCAGCGC CTCGACAATG TGGTGCTCTT GTTCGACACC
AGCGCCTCGC GCGCGCCCGG CTTCGCCGGC GCGGTGGACG ATTTCGGACG CCTGGTCGCC
GAGCTCGCCG ACCTGCACGG CGACGGCCTG CGCGTGCGCG TGGCCGCCTT TGACCAGTCG
GTGACGCCGC TCTACGAAGG CCCGGCCACC GGCTTTGGCG GCAAGCAGCT CGACGCCCTG
CGCGCGCGCC GTCCGCTCGG CGCCTCGGAC CTGCACGCGG CCCTGAGCTG GGCCGGCGCC
CAGGGCGGGC GCATCGTGGT GCTCAGCGAC GGCATCGCCA CCACCGGCCC GTCCGAGGCC
GGCGAGCTGC GCGCCACGGC CCGCGCGATG GGCGACGCGG TGCCGCGCAT CGACATGATC
CTGGCCGGCG GCATCCGCGA CCGCGCCATG GCCGAGCGCC TGGTCCGCGG CACCCGCGCC
CACGACGGCG TGGTGCTCGA CGCCGAGAGC CCGGCCAAGA CCCTGGCCGA GCGCCTCAGC
CAGACCACGG TGAGCGGCAT CGAGGTCGCG GTCCCGGGCG CGGCCTGGGT GTGGCCGCAC
ACGCTCGACG GCGTGCAGCC GGGCGATGAG CTGCTGGTCT TCGCCCAGCT CGACAGCGCC
GGCGCCCTGG CCGCCAGCAA GCCGATGCAG GTGCAGCTCA GCGGCCCGGT CGCCGGCGGC
GCCCAGACCG TCAGCGTGCC GCTCTCGAGC GTGCCCGGTC CGCTGCTCGA GCGCGCGGTC
GCGCAGGCCC GCATCGAGCT GCTCACGGCG CAGCGCGACA CCCTCGACGA CGCCCCGGGC
GCGGCGGCCG AGCGCGTCCG CCTGCACAAC GAGATCGTAG CGGTGTCCAC CGAGCACCGC
GTGCTCTCGG ACGCCACCGC CCTGCTGGTG CTCGAGACCG AGCAGGATTA CGCGCGCTAC
GGCATCGACC GCAAGGCGCT CTCGAACATC CTGAGCGTGG GCCCCAGGGG CCTCGAGATG
CTCCAGCGCG GCGGCCCGGT GGTGCTGGCG CAACCGGCGA CCCAGAAGGC CAAGAAGGAC
GCGCGCATGG ACGCGAACAA GGGCTCGCGC GGCGGGCTCG AGCTGGCCAC CGAGGGCAAC
GCGCAAGGCG AAGGCGGCGC GCCCGCCGAC GACGGCGCGC TCATGCAGGG CGATCCCGCG
GAGCGCGAGT TCGATAGCGA GGCTATCGGC GGCACCCCGG GCGGGGCGGT TCCGGCCGCC
GAGCCCGCTC CGGCTGCCGA GATGGCGGCC GGCGGTACCG GCTCGGGCGG CGGCGGTGAC
GGCTACGGCC GCCTGGCCGA GGCCCGGGAC GAGGCCCCGC GCATGGCCGA CGACGCCGAC
GAGGCCGAGG AGGCGTCTGC GGTGCCGCCG CCGCCGCCCT CGCGTCCGGC GCCGCGCCGC
AGCCGCCGCG CGTCGTCTGC GTCGGCGGCG GCGCCGAGCA TGGACCGCAG CCGCAACGAG
GCCCCGGCTC AGCTCGCGCA GGCGCCCGCG GATAAGAAGC CGAGCGCGCC CGCGCTCACC
GGCAAGCTGG CCGAGGTCAT GGCGCTGCTC GATCGCGGCG AGCTCGACTC GGCCGTGACC
CTGGCCCTGC GCTGGCAGAG CGAGGAGCCG GGCGACGTGC TCGCCCTGGT CGCGCTCGGC
GAGACGCTCG AGGCCGCCAA GCGCCCGGCG CTGGCCGCGC GCGCCTACGG CTCGCTCATC
GACCTCTTCC CCTCGCGCGC CGACATGCGC CGCTTCGCCG GCGAGCGCCT CGACCGCCTG
AGCAGCCACG GCGCCGACCT GGCCGCGGAC ACCTACGAGA AGGCCGTGGC CCAGCGCCCC
GATCACCTCA CCGGCCACCG CCTGCTGGCC ATGGCCCTGC TCCGCCAGGG CAACTACGCG
GCCGCCTTCG ACGCCGCCCT CGCCGGCCTG GCCCGCGAGT ATCCCGAAAA CCGCTTCCGC
GGCGGCAAGC GCATCCTGCG CGAGGACCTG GGCCTGATCG CGGCCGCGTG GCTGGCCGCC
GAGCCGGGCG AGAGCAAGAC CGTCGAGGGG CGGCTGCGCA GCGCGGGCGC CGAGCTGGCC
ACCCAGCCCT CGACGCGCTT CGTGCTCACC TGGGAGACCG ACGCCAATGA CGTCGATTTC
CACATCCACG ACGCCAAAGG CGGACACGCG TACTACAGCT CGCCCGAGCT GCCCTCGGGC
GGCGCGCTGT ACGCCGACGT GACCACGGGC TACGGCCCCG AGTGCTTCAC CATCCCGGGC
AAGGCCCAGG CCGGCCCCTA CCGCCTGCAG CTCCACTACT ACAGCCGCGG CCCCATGGGC
TACGGCATGG GCAAGCTCGA GATCCTCAAG CACGACGGCA AGGGCGGCCT CAGCTTCGAG
CAGCGGCCCT TCGTGGTCAT GCTCGATGGC GCCTACGTCG ATATCGGCGA GGTCCACCCG
TAG
 
Protein sequence
MTPTHPSLRT AALAALATLA AACTTSPGPD PSPVKPNPPK HTAGTESGDP DAPPERLSPL 
AVTEASASSF AGFDSSALLA AATRYPTAEL ATDVAPPMSL TASDGTGLRL VALDARAVIE
GPLAFTELHL SFQNPRPNTI EGRFAITLPE GAAISRLAMR LPSGWQEAEV VERQAARRTY
EDFLHRRQDP ALLEKEAGNE FRARIFPIAG NERKDIIISY SQNLIGDQAV YRLPLRGLPA
VDELKVSALV GGRKGEGLAF KPATMSLQQQ APQRDFEVVP PPAAQRPRGL RYGAQLVARV
APEVATAEQR LDNVVLLFDT SASRAPGFAG AVDDFGRLVA ELADLHGDGL RVRVAAFDQS
VTPLYEGPAT GFGGKQLDAL RARRPLGASD LHAALSWAGA QGGRIVVLSD GIATTGPSEA
GELRATARAM GDAVPRIDMI LAGGIRDRAM AERLVRGTRA HDGVVLDAES PAKTLAERLS
QTTVSGIEVA VPGAAWVWPH TLDGVQPGDE LLVFAQLDSA GALAASKPMQ VQLSGPVAGG
AQTVSVPLSS VPGPLLERAV AQARIELLTA QRDTLDDAPG AAAERVRLHN EIVAVSTEHR
VLSDATALLV LETEQDYARY GIDRKALSNI LSVGPRGLEM LQRGGPVVLA QPATQKAKKD
ARMDANKGSR GGLELATEGN AQGEGGAPAD DGALMQGDPA EREFDSEAIG GTPGGAVPAA
EPAPAAEMAA GGTGSGGGGD GYGRLAEARD EAPRMADDAD EAEEASAVPP PPPSRPAPRR
SRRASSASAA APSMDRSRNE APAQLAQAPA DKKPSAPALT GKLAEVMALL DRGELDSAVT
LALRWQSEEP GDVLALVALG ETLEAAKRPA LAARAYGSLI DLFPSRADMR RFAGERLDRL
SSHGADLAAD TYEKAVAQRP DHLTGHRLLA MALLRQGNYA AAFDAALAGL AREYPENRFR
GGKRILREDL GLIAAAWLAA EPGESKTVEG RLRSAGAELA TQPSTRFVLT WETDANDVDF
HIHDAKGGHA YYSSPELPSG GALYADVTTG YGPECFTIPG KAQAGPYRLQ LHYYSRGPMG
YGMGKLEILK HDGKGGLSFE QRPFVVMLDG AYVDIGEVHP