Gene Hoch_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3608 
Symbol 
ID8545998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4963377 
End bp4966403 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content69% 
IMG OID646388277 
Producthypothetical protein 
Protein accessionYP_003268003 
Protein GI262196794 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00290746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCCCAT CGCCCAAGCT AGACGATCGC CAGTTCCAGG ACATCGTAGA CGAAGCGCTG 
CGGCTGATCC CTCGCTACGC GCCCGAGTGG ACCAACCACA ACCCCTCCGA TCCTGGTATC
ACTCTCCTCG AGCTAGCCGC GTGGATGACC GATCAGATTC TGTATCGGCT CAACCGGGTT
CCCGAGAAGA ACTACATCGC CTTCCTCAAT CTGCTCGGCA TCAAGCTCAA GCCGCCGCGC
GCGGCCTGGG TGCTGGTGCA GTTCGAGTTG GTCGAGGGCG CCTCGCGCCA GCGGGTCCCC
GCGGCCACCC AGGTGGCGAC GCCGCAGGGC ACCGAGGAGG AGACGGTCAC CTTCGAGACC
GAGCGCGACC TCATCGTCAC CAGCGTGTCG CTCGACCGGG TGTTTAGCTA CTTCAACGAC
ACCTACGCCG ACAACACGCC CTTCCTCGAC GGCAAGCGCG GCCACAGCTT CGAGGTCTTC
GGCGGCGCCG ACCGGGTCGA GCGCTTCCTG TACCTGAGCG ACCCGCGCCT GGCCAACGCC
GGCGAGTCCT CGGTGCTGCG CATCTTCCTC GGCTGCCCCG AGCGCGGCGG CCGCGACATG
GCCCGCCTGC TCGAGTGGGA GTACTGGAAC GGCGACCGCT GGAAAGAGCT GATCCAGGCG
CCGGTCGAGG TCGAGCGCGG CGAGGTGTGC TTCTTCGGAC CGCTGTCCTT CGCGCCCACC
GAGGTCGACG GCATCGAGGG CCTGTGGGTG CGCGGCCGCC TGGCCGAGGT CCCCGACAAC
GGCGAGGTCA CCGAGGTCGA CACCGTGCGC ATGCGCATCG AGGTGGTCGG CGACGGCGTG
ATCCCCGAGC GCGCGGTGGC CAACCTCGAC AACAACGCCT ACATCCTGCT CGACACCGGC
AAGAACATGT TCCCCTTCGG CAAAGAGCCG GGCGTGGACT GTGTGCTGTA CCTGGCCTGT
GACGAGCTCT TGCAGACCCC GGAGGCGTAC CTCGGCATCG AGTTCGTGCT CGCCGACGCC
TCGGCCATTC CCCGGCCGCT GCCGTCCGAC GACCTGGTGC TCGCCTGGGA GTTCTTCGAC
GGCAAGCGCT GGCAGCTTCT GGGCCGCTCC AACGTCCGCG GCATCCTGCC CGGCGCGGGC
GACGAGTACG GCTTCCACGA CGAGACCAAG GCGCTGAGCA AGAGCGGCGC GGTCCGTTTC
CGCCGCCCCA AGAACATGGA GCTGAGCGAG CTCAACGGCG AGGAGGCCCG CTGGGTGCGC
GCGCGCATCG AGAAGGGCGA CTACGGCGAG GCCGGCACCT ATACCCTCGA CGGCGACAAG
TGGGTGTTCA AAGAGGACCG CCCGCTGCGC CCGCCGGCGC TCAAGAGCAT CACCTTCCGC
TACCGCGAGG ACTACCGCAA CGTCCGCCAC ACGCTGGCGT TCAACGACTT CTCGTACACC
GACTACTCGG ACAACGCGCG CACCGACTTC ACCCTGTTTC AGCCCTTCTC GCCGCGCAGC
GACGAGTCGC CGACGCTGTA CCTGGGCTTC GACAGCGGCC TGCCCAACGA CGCCCTGGGC
GTCTACTTCC ACATGGAAGA GGAGCTGGGC CTCACCGCGC GCGACGCCGA CACCGGCGGC
TCCGAGGTGG TCACCGCCGA GCTCACCAAC TACAACGCCG CGCAGGCCAG CGCCTGGGCC
GCGGAGCAGC GCGTGGTGTG GGAGTACTGG GACAAGGAGA GCTGGGTGCC GCTGGTGGTC
TCGGACGAGA CCGAGAGCTT CACCCGCTCG GGCTTCATCC GCTTCGTGAG CCCCGAGGAC
TGGTCGGCGA CCATGAAGTT CACCGAGGAG CGCCACTGGC TGCGCGCGCG CCTGGAGATG
GGCGGCTACG TCAAGGCGCC GCGCATCCGC CGCATCCTCA CCAATGTGGT CGAGGCCCGC
AACCACACCA CCATCCGCGA CGAGATCCTG GGCTCTTCGG ACGCCTCGCC GCTGCAGAGC
TTCGACATCC TGCGCGCGCC GCTGCTCGAG GGCGAGCTCA TCGAGGTGCG CGAGCGCCAG
GCGCCCTCCG AGGAGGAGGA GGCCGCGCTC GAAGAGGGAG CGGTGCGCGC CATCAGCGAG
GACAGCGACG AGGTCTGGGT GCGCTGGAAG CGGGTCGATA GCTTCTTCGA GTCCGGCCCC
CGCGACCGCC ACTACCGCAT CGACTACCAG AGCAACGCGG TCACCTTCGG CGACGGACGC
CGGGGCATGA TCCCGCCCGA GGGCAAGAAC AGCATCGTGG CCCGCCGCTA TCAGATCGGC
GGCGGCGCCT CGGGCAACGT CAACCCCGAC ACGCTCACCT CGCTCAACCG CGCGCTGTCG
TACATCGACT CGGTGACCAA CCCCATCGGG GCCACCGGCG GCGCCGACCG CGAGAGCGTC
GAGGAGGCCA AGGAGCGGGC GCCGTACACC ATCAAGAGCC GCGACCGCGC GGTCACGGCC
GAGGACTTCG AGACCCTGGC GCTGCGCGCC TCGACCTCGA TCGCGCGCGC CAAGTGCGTG
CCCGACCGCA GCAACCGCGG CGCCGTGACC CTGGTGGTGC TGCCCAAGGC CGAGTCCGGC
GCGCGCGGGC TCACCCGCCG CCTGGTGCCC TCGAACGAGG TGCTGCGCTA CATCAAACGC
TACCTCGACG ACCGCCGCCT GGTCGGCACC ATCCTCAACG TGGTGCGCCC GCGCTACCGC
GACCTGTCGC TCAAGGTGAC GCTCTTGCGG CGCACCATCG GCACCTCGGA TCGGCTGCGG
CGCGACATCA CCATCAAGCT GCGCACCTAC CTGCATCCGC TGGTCGGCGG CCGCAGCGGC
AAGGGCTGGG AATTCGGACG AGCGGTCCTG AAAACCGAGC TCGTCCACGC GGTCGAAGAA
GTCCCCGGGG TCGAGGGCGT CGATGCCCTC GAGATCCTCG ATGAAGAGCG GCGCGTTCAC
GTCGAGCACG TGCGCGTCGA AGATGACGAG CTGCCCTTCT TGGTGCACGT GCACATCGTC
GAGAAGGTCC GCGACGAGAT CATGTGA
 
Protein sequence
MIPSPKLDDR QFQDIVDEAL RLIPRYAPEW TNHNPSDPGI TLLELAAWMT DQILYRLNRV 
PEKNYIAFLN LLGIKLKPPR AAWVLVQFEL VEGASRQRVP AATQVATPQG TEEETVTFET
ERDLIVTSVS LDRVFSYFND TYADNTPFLD GKRGHSFEVF GGADRVERFL YLSDPRLANA
GESSVLRIFL GCPERGGRDM ARLLEWEYWN GDRWKELIQA PVEVERGEVC FFGPLSFAPT
EVDGIEGLWV RGRLAEVPDN GEVTEVDTVR MRIEVVGDGV IPERAVANLD NNAYILLDTG
KNMFPFGKEP GVDCVLYLAC DELLQTPEAY LGIEFVLADA SAIPRPLPSD DLVLAWEFFD
GKRWQLLGRS NVRGILPGAG DEYGFHDETK ALSKSGAVRF RRPKNMELSE LNGEEARWVR
ARIEKGDYGE AGTYTLDGDK WVFKEDRPLR PPALKSITFR YREDYRNVRH TLAFNDFSYT
DYSDNARTDF TLFQPFSPRS DESPTLYLGF DSGLPNDALG VYFHMEEELG LTARDADTGG
SEVVTAELTN YNAAQASAWA AEQRVVWEYW DKESWVPLVV SDETESFTRS GFIRFVSPED
WSATMKFTEE RHWLRARLEM GGYVKAPRIR RILTNVVEAR NHTTIRDEIL GSSDASPLQS
FDILRAPLLE GELIEVRERQ APSEEEEAAL EEGAVRAISE DSDEVWVRWK RVDSFFESGP
RDRHYRIDYQ SNAVTFGDGR RGMIPPEGKN SIVARRYQIG GGASGNVNPD TLTSLNRALS
YIDSVTNPIG ATGGADRESV EEAKERAPYT IKSRDRAVTA EDFETLALRA STSIARAKCV
PDRSNRGAVT LVVLPKAESG ARGLTRRLVP SNEVLRYIKR YLDDRRLVGT ILNVVRPRYR
DLSLKVTLLR RTIGTSDRLR RDITIKLRTY LHPLVGGRSG KGWEFGRAVL KTELVHAVEE
VPGVEGVDAL EILDEERRVH VEHVRVEDDE LPFLVHVHIV EKVRDEIM