Gene Hoch_3711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3711 
Symbol 
ID8546101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5108426 
End bp5110618 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content69% 
IMG OID646388378 
Producthypothetical protein 
Protein accessionYP_003268104 
Protein GI262196895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0120943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.530699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGAT CTGCACTGAT TCTTGCCTTT GGACTCGCTG TCACCGCCTG CGGTGGCGAG 
TCCGCGCCCC AGGGTCCCCT GGACGGCGTC GACTCCGTCG TGTTCCTGCA GCGCGCCAAG
CGCGGCGGCA CCGGCGACAT CTTCAACTAC ACATTCTATG AACCGGGCGG CCGGCTGGTG
ACGCTGACCC CGCCCACGGC CGACGGCGAG CTCGAGGTCA TCTGCTGTGA CCAGGACGAG
GCCTTCGCCC AGGCCGACAT CTCCGGCTAC GACCTGTCCT TCGACGCCCG CGAGATCGTG
TTCTCGGCCA AGCTGGGCCC GCAGGAGCAC TACGGCCTGT ACCTGCTCTC GCTCGAGAGC
GGCGAGATCA CGGCCATCCC GGGTGATCCC AACGAGGACT ACATCCAGCC GGCCTTCCTG
CCCGGCGACC GCATCTTCTT CACCACCAGC GCCGTGGTCG AGGACGGCAC CCCGCAGTTC
CGCGACGAGT ACGAGCGCCG CGTGACCTCG CAGATCGGCA TCATCAACCG CGACGGCACC
GAGGAGACCC TCGGGCCCCG CAACCTCTCG CACCGCGTGT TCCCGACCGT GCTCAGCGAC
GGCCGCGTGA TGCTCACGCA CTGGAACCAC CTCGGCGAGA TGAACGCCGG CCACCTGGTG
GTCATGAACC CGGACATGAC GCGCATGCGC GAGGCCTTCG GCAAAGAGGG CACCGGCGTG
ACCAACTCGT ACCTCAAGGC GGTCGAGATC GCCCCCGGCC GCGTCATCGC GGTGGGCACC
TCGCGCGACC GCACGGTGCA GTCGGGCGCG CTGATCGACA TCCGCCTGGG CGAGGTCTAC
TCCGAGGACG GCGAGCTGCG CGCCGACCGC AATATGTCCG AGGAGAACGC GACCTACAGC
ATCCTCACCC CGCAGGTGCC GCTGGGCAAC GAGCCCTCGT TCCAGAACGT GGGCCGCTAC
TACGACGCCT ACCCGCTCGA CAGCGGCGAG TATCCCAACC TGCTCACCTC CTGGGCCGAC
GGCCCGGTCG AGGACGCCTC GCTCGAGGCC GGCGGCCTCG ACGCCGACTT CGGCATCTAC
CTCTACGACT CGCAGAAGGG CCTGCGCCGC CCGGTGTGGA ACGACGCCAA CCGCTGGGAC
GTGTTCCCGC GGCCGCTGGT GTCGCGCAGC GCGCCGCCCG TCATCCCCGA CTCGGCGAGC
AACGAGATCG CCGGCGACGC CGTGCTGCTC GGCTCCACCA ACGTCTACGA GTCGAGCCTC
GATAGCTTCG AGGCCGGTTC GATCTACGGC GTGCGCGTGC TCGAGGGCTT CTCGAGCGAA
GAGGGCCTGC CGCGCGACTT CGGCCTCACC GAGCACGAGG GCTCCTCGCA GCTCGGCATC
GCCCAGGTCC GCGACGACGG TAGCTGGGCC GCGCTCATCC CGCCCAACGT GCCCATCCAC
TCGCAGGTCA TCGACCGCTT CGGCATGTCG CTGCGCAGCG AGCCCATCTG GATCTCGGGC
CGCAAAGGCG AGTCGCTGTT CTGCAACGGC TGCCACGAGG ATCGCGCCCG CACCACGGTG
CTCAACCCCG GCGTGACCGA GGCCGTCATC ATCGGCCCCA ACGACCTGCG CAGCACCGAG
GACCGCAGCC AGCGCCAGTC CAGCGACTTC TCGATCGACG CGGTCGCCGG CGTGCCCTGG
GACACCGCGC TGCAGCCGAT CTTCGACGCC AAGTGCGTGA GCTGCCACGA CGGCGACGCC
GCCAAGGAGG GCAACCCCTC GTACACCATC ACCGACCCCG AGAGCGGTGA GTCCTTCACC
TGGACCTTCG ATCTCAGCGG CAACGCGGTC GAGATCGGCG TCGGCGAGAC CCTGTTCAGC
GGCTACTCGG CCTCGCACCT GTCGCTGATG GGCCCGGGCC CCGAGGTCAC CCGCGACCTG
GAGAAAGCCG GCCTCGAGCT CGACGCCGAC ATCCCGGTGT TCGTCATCCC GTCCGAGGCG
CGCAACTCGC GCCTGCTGCA GAAGCTCAAC CCGCCGCAGC TCTTCCCCGA GACCGACACC
GATGTCCGCG CCTTCGACAC CGCCGTGCAC GGCGAGGAGC AGGGCTTCAC CCTCACCGCG
GACGAGTACT ACCTGCTGAT CCTCATGGCC GACAGCGGCG GTCAGTTCTA CTCGCGCGAG
AACGCGCCCA GCGAAACCGC CAATGCCAAC TGA
 
Protein sequence
MKRSALILAF GLAVTACGGE SAPQGPLDGV DSVVFLQRAK RGGTGDIFNY TFYEPGGRLV 
TLTPPTADGE LEVICCDQDE AFAQADISGY DLSFDAREIV FSAKLGPQEH YGLYLLSLES
GEITAIPGDP NEDYIQPAFL PGDRIFFTTS AVVEDGTPQF RDEYERRVTS QIGIINRDGT
EETLGPRNLS HRVFPTVLSD GRVMLTHWNH LGEMNAGHLV VMNPDMTRMR EAFGKEGTGV
TNSYLKAVEI APGRVIAVGT SRDRTVQSGA LIDIRLGEVY SEDGELRADR NMSEENATYS
ILTPQVPLGN EPSFQNVGRY YDAYPLDSGE YPNLLTSWAD GPVEDASLEA GGLDADFGIY
LYDSQKGLRR PVWNDANRWD VFPRPLVSRS APPVIPDSAS NEIAGDAVLL GSTNVYESSL
DSFEAGSIYG VRVLEGFSSE EGLPRDFGLT EHEGSSQLGI AQVRDDGSWA ALIPPNVPIH
SQVIDRFGMS LRSEPIWISG RKGESLFCNG CHEDRARTTV LNPGVTEAVI IGPNDLRSTE
DRSQRQSSDF SIDAVAGVPW DTALQPIFDA KCVSCHDGDA AKEGNPSYTI TDPESGESFT
WTFDLSGNAV EIGVGETLFS GYSASHLSLM GPGPEVTRDL EKAGLELDAD IPVFVIPSEA
RNSRLLQKLN PPQLFPETDT DVRAFDTAVH GEEQGFTLTA DEYYLLILMA DSGGQFYSRE
NAPSETANAN