Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3608 |
Symbol | |
ID | 8545998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4963377 |
End bp | 4966403 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646388277 |
Product | hypothetical protein |
Protein accession | YP_003268003 |
Protein GI | 262196794 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02243] conserved hypothetical protein, phage tail-like region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.607062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00290746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCCCAT CGCCCAAGCT AGACGATCGC CAGTTCCAGG ACATCGTAGA CGAAGCGCTG CGGCTGATCC CTCGCTACGC GCCCGAGTGG ACCAACCACA ACCCCTCCGA TCCTGGTATC ACTCTCCTCG AGCTAGCCGC GTGGATGACC GATCAGATTC TGTATCGGCT CAACCGGGTT CCCGAGAAGA ACTACATCGC CTTCCTCAAT CTGCTCGGCA TCAAGCTCAA GCCGCCGCGC GCGGCCTGGG TGCTGGTGCA GTTCGAGTTG GTCGAGGGCG CCTCGCGCCA GCGGGTCCCC GCGGCCACCC AGGTGGCGAC GCCGCAGGGC ACCGAGGAGG AGACGGTCAC CTTCGAGACC GAGCGCGACC TCATCGTCAC CAGCGTGTCG CTCGACCGGG TGTTTAGCTA CTTCAACGAC ACCTACGCCG ACAACACGCC CTTCCTCGAC GGCAAGCGCG GCCACAGCTT CGAGGTCTTC GGCGGCGCCG ACCGGGTCGA GCGCTTCCTG TACCTGAGCG ACCCGCGCCT GGCCAACGCC GGCGAGTCCT CGGTGCTGCG CATCTTCCTC GGCTGCCCCG AGCGCGGCGG CCGCGACATG GCCCGCCTGC TCGAGTGGGA GTACTGGAAC GGCGACCGCT GGAAAGAGCT GATCCAGGCG CCGGTCGAGG TCGAGCGCGG CGAGGTGTGC TTCTTCGGAC CGCTGTCCTT CGCGCCCACC GAGGTCGACG GCATCGAGGG CCTGTGGGTG CGCGGCCGCC TGGCCGAGGT CCCCGACAAC GGCGAGGTCA CCGAGGTCGA CACCGTGCGC ATGCGCATCG AGGTGGTCGG CGACGGCGTG ATCCCCGAGC GCGCGGTGGC CAACCTCGAC AACAACGCCT ACATCCTGCT CGACACCGGC AAGAACATGT TCCCCTTCGG CAAAGAGCCG GGCGTGGACT GTGTGCTGTA CCTGGCCTGT GACGAGCTCT TGCAGACCCC GGAGGCGTAC CTCGGCATCG AGTTCGTGCT CGCCGACGCC TCGGCCATTC CCCGGCCGCT GCCGTCCGAC GACCTGGTGC TCGCCTGGGA GTTCTTCGAC GGCAAGCGCT GGCAGCTTCT GGGCCGCTCC AACGTCCGCG GCATCCTGCC CGGCGCGGGC GACGAGTACG GCTTCCACGA CGAGACCAAG GCGCTGAGCA AGAGCGGCGC GGTCCGTTTC CGCCGCCCCA AGAACATGGA GCTGAGCGAG CTCAACGGCG AGGAGGCCCG CTGGGTGCGC GCGCGCATCG AGAAGGGCGA CTACGGCGAG GCCGGCACCT ATACCCTCGA CGGCGACAAG TGGGTGTTCA AAGAGGACCG CCCGCTGCGC CCGCCGGCGC TCAAGAGCAT CACCTTCCGC TACCGCGAGG ACTACCGCAA CGTCCGCCAC ACGCTGGCGT TCAACGACTT CTCGTACACC GACTACTCGG ACAACGCGCG CACCGACTTC ACCCTGTTTC AGCCCTTCTC GCCGCGCAGC GACGAGTCGC CGACGCTGTA CCTGGGCTTC GACAGCGGCC TGCCCAACGA CGCCCTGGGC GTCTACTTCC ACATGGAAGA GGAGCTGGGC CTCACCGCGC GCGACGCCGA CACCGGCGGC TCCGAGGTGG TCACCGCCGA GCTCACCAAC TACAACGCCG CGCAGGCCAG CGCCTGGGCC GCGGAGCAGC GCGTGGTGTG GGAGTACTGG GACAAGGAGA GCTGGGTGCC GCTGGTGGTC TCGGACGAGA CCGAGAGCTT CACCCGCTCG GGCTTCATCC GCTTCGTGAG CCCCGAGGAC TGGTCGGCGA CCATGAAGTT CACCGAGGAG CGCCACTGGC TGCGCGCGCG CCTGGAGATG GGCGGCTACG TCAAGGCGCC GCGCATCCGC CGCATCCTCA CCAATGTGGT CGAGGCCCGC AACCACACCA CCATCCGCGA CGAGATCCTG GGCTCTTCGG ACGCCTCGCC GCTGCAGAGC TTCGACATCC TGCGCGCGCC GCTGCTCGAG GGCGAGCTCA TCGAGGTGCG CGAGCGCCAG GCGCCCTCCG AGGAGGAGGA GGCCGCGCTC GAAGAGGGAG CGGTGCGCGC CATCAGCGAG GACAGCGACG AGGTCTGGGT GCGCTGGAAG CGGGTCGATA GCTTCTTCGA GTCCGGCCCC CGCGACCGCC ACTACCGCAT CGACTACCAG AGCAACGCGG TCACCTTCGG CGACGGACGC CGGGGCATGA TCCCGCCCGA GGGCAAGAAC AGCATCGTGG CCCGCCGCTA TCAGATCGGC GGCGGCGCCT CGGGCAACGT CAACCCCGAC ACGCTCACCT CGCTCAACCG CGCGCTGTCG TACATCGACT CGGTGACCAA CCCCATCGGG GCCACCGGCG GCGCCGACCG CGAGAGCGTC GAGGAGGCCA AGGAGCGGGC GCCGTACACC ATCAAGAGCC GCGACCGCGC GGTCACGGCC GAGGACTTCG AGACCCTGGC GCTGCGCGCC TCGACCTCGA TCGCGCGCGC CAAGTGCGTG CCCGACCGCA GCAACCGCGG CGCCGTGACC CTGGTGGTGC TGCCCAAGGC CGAGTCCGGC GCGCGCGGGC TCACCCGCCG CCTGGTGCCC TCGAACGAGG TGCTGCGCTA CATCAAACGC TACCTCGACG ACCGCCGCCT GGTCGGCACC ATCCTCAACG TGGTGCGCCC GCGCTACCGC GACCTGTCGC TCAAGGTGAC GCTCTTGCGG CGCACCATCG GCACCTCGGA TCGGCTGCGG CGCGACATCA CCATCAAGCT GCGCACCTAC CTGCATCCGC TGGTCGGCGG CCGCAGCGGC AAGGGCTGGG AATTCGGACG AGCGGTCCTG AAAACCGAGC TCGTCCACGC GGTCGAAGAA GTCCCCGGGG TCGAGGGCGT CGATGCCCTC GAGATCCTCG ATGAAGAGCG GCGCGTTCAC GTCGAGCACG TGCGCGTCGA AGATGACGAG CTGCCCTTCT TGGTGCACGT GCACATCGTC GAGAAGGTCC GCGACGAGAT CATGTGA
|
Protein sequence | MIPSPKLDDR QFQDIVDEAL RLIPRYAPEW TNHNPSDPGI TLLELAAWMT DQILYRLNRV PEKNYIAFLN LLGIKLKPPR AAWVLVQFEL VEGASRQRVP AATQVATPQG TEEETVTFET ERDLIVTSVS LDRVFSYFND TYADNTPFLD GKRGHSFEVF GGADRVERFL YLSDPRLANA GESSVLRIFL GCPERGGRDM ARLLEWEYWN GDRWKELIQA PVEVERGEVC FFGPLSFAPT EVDGIEGLWV RGRLAEVPDN GEVTEVDTVR MRIEVVGDGV IPERAVANLD NNAYILLDTG KNMFPFGKEP GVDCVLYLAC DELLQTPEAY LGIEFVLADA SAIPRPLPSD DLVLAWEFFD GKRWQLLGRS NVRGILPGAG DEYGFHDETK ALSKSGAVRF RRPKNMELSE LNGEEARWVR ARIEKGDYGE AGTYTLDGDK WVFKEDRPLR PPALKSITFR YREDYRNVRH TLAFNDFSYT DYSDNARTDF TLFQPFSPRS DESPTLYLGF DSGLPNDALG VYFHMEEELG LTARDADTGG SEVVTAELTN YNAAQASAWA AEQRVVWEYW DKESWVPLVV SDETESFTRS GFIRFVSPED WSATMKFTEE RHWLRARLEM GGYVKAPRIR RILTNVVEAR NHTTIRDEIL GSSDASPLQS FDILRAPLLE GELIEVRERQ APSEEEEAAL EEGAVRAISE DSDEVWVRWK RVDSFFESGP RDRHYRIDYQ SNAVTFGDGR RGMIPPEGKN SIVARRYQIG GGASGNVNPD TLTSLNRALS YIDSVTNPIG ATGGADRESV EEAKERAPYT IKSRDRAVTA EDFETLALRA STSIARAKCV PDRSNRGAVT LVVLPKAESG ARGLTRRLVP SNEVLRYIKR YLDDRRLVGT ILNVVRPRYR DLSLKVTLLR RTIGTSDRLR RDITIKLRTY LHPLVGGRSG KGWEFGRAVL KTELVHAVEE VPGVEGVDAL EILDEERRVH VEHVRVEDDE LPFLVHVHIV EKVRDEIM
|
| |