Gene Hoch_0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0956 
Symbol 
ID8543338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1220710 
End bp1222500 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content52% 
IMG OID646385717 
ProductProtein of unknown function DUF2326 
Protein accessionYP_003265452 
Protein GI262194243 
COG category[S] Function unknown 
COG ID[COG5293] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT CTCGAATCTA TAGCAATAGG CCAGAAGAGT TTCCGCCCAT ACGGTTTAAT 
GATGGATTGA ACGTCGTACT CGCGGAGATT CGAGACCCTG CGAACCTTAG CAAGGACACG
CACAACCTCG GCAAGACGAC CCTTGCGCAT CTTATCGACT TCTGTCTCCT CAAGAGCAAA
AAGAAGCAAT TTTTCCTCAT GAAGCATGAG GACCGTTTCA GCCAGTTTGT CTTCTTTCTG
GAACTTCACC TCAGCGAAGG TAGGTTTATA ACCATCCGTC GGCCGGTGGC CGAGGCAACT
AAAATATCGC TTTTCTATTC CTCGGAATCA TGTCCAGATG CGAACCTTTT CTTAGAAAAG
GAGTGGAGCC ACTGGCGACT TCCGTTCGAA AAAGCCAAGA ATCTCGTAGA AGCACAGCTT
AGCTTCTCGG TTGCTACAGG ATGGGCATTT CGCAAGCCGC TAGGGTATGC GCTCAGGCTT
CAGGACGATT ACCAGGATGT CTTCCAGCTT GGAAAGTTTC GTGGGTCGCA CGGGGACTGG
AAGCCTTTTT TGGCAGAACT CATTGGCCTC GACGGGAAGC TGGTTCAGGA GTCGTACGAT
CTGGCTGCCG AGGTAGAGGC CATTGGTCGC TCGATCGCAG AGCTTGAACC CCTACTTGTA
GGGCTCGCTG ATTCGCCGGA CCGACTCGAA GGGATGATTC TCATCCGGAC GAAAGAAGTG
CAGGAACTTG AGCGAAAGCT CGGCGACTTC GATTTCAAGC TCCCTGATGA GGGAATTTCC
GAGGAGCTCG TTAACAGAAT TGAAGCAACG ATTGCCGGAC ACAATGAGCG TCGCTATCAT
CTCCGGATGT CTCTCCAGAA GCTCGATAGG ACACTTGGGC AGACTATTCA GTTCGATATC
GCGGGCATCG AGCGGCTCTT TGCTGATGCG CAGGTCTACT TCGGTGATCA ATTGAAGCGC
GACTACAGCG ACTTGATGTC ATTCCTTGCA AGTATCTCGA AGGAGCGCTC GGACTTACTC
CAGGAAGAAC GGGCCGAGAC CATTCAGGAG TTAGGCGAAA TCGATGCCGA ACTGGAGAAA
CTCAACAAGA GAAGAATCGA GGCGCTTACC ACATTGCGCC AGGCGAAGAC GCTGGCAAAA
TATCGAGAGC ACACAGAAAG GCTTGTAGAT TTAAAGGCTA CGTTGGCGCT CCTTGAGCGT
CAGCGGGAAC AGATGGACCG TCTTACGGAG CTCCGCCACA CTCTTAGCGA TGTGCACCAG
AAGCGAACAG AGGTGGTGCA GGCGGTCGAA GAGAACATCA CTAAGTCCAC TCGTCAAGAG
AGTCGCTACC GTACAATTCG GGTCGAGTTC GGCGCGATCG TTAAACGGGT GCTCGCTCGT
CCGGCGGTGC TATCGTGCAA GCTGAATAGT AAAAAAAATA TCGAGTTCCG GGCTGAGATC
TTAGATGAGA CCGACACCGC TACAAGTGCG GACGATGGGC ATACCTACCG CAAGCTGCTG
TGTGTGGCGT TTGATGCCGC TGTGTTCCTT GCTTACCTCA ATGATTCATT CATTCATTTT
GTGTTTCACG ACGGAGTGTT CGAAAGCCTA GATGATCGAA AGAAGCGGTG TCTACTCGAT
GAACTCCGGG CTTGGGGCGC AGAGGGCCTT CAGCAAATCA TCACGGTCAT TGATTCAGAT
CTGCCTTTAG GGGAGGATGG AAGCCGTATG GCTTTCGATG ATCGTGAGAT CGTACGCTTG
CTGCATGATC AAGGCGATGG TGGCCGATTG TTCCGAATGC CACCCTGGTA G
 
Protein sequence
MKLSRIYSNR PEEFPPIRFN DGLNVVLAEI RDPANLSKDT HNLGKTTLAH LIDFCLLKSK 
KKQFFLMKHE DRFSQFVFFL ELHLSEGRFI TIRRPVAEAT KISLFYSSES CPDANLFLEK
EWSHWRLPFE KAKNLVEAQL SFSVATGWAF RKPLGYALRL QDDYQDVFQL GKFRGSHGDW
KPFLAELIGL DGKLVQESYD LAAEVEAIGR SIAELEPLLV GLADSPDRLE GMILIRTKEV
QELERKLGDF DFKLPDEGIS EELVNRIEAT IAGHNERRYH LRMSLQKLDR TLGQTIQFDI
AGIERLFADA QVYFGDQLKR DYSDLMSFLA SISKERSDLL QEERAETIQE LGEIDAELEK
LNKRRIEALT TLRQAKTLAK YREHTERLVD LKATLALLER QREQMDRLTE LRHTLSDVHQ
KRTEVVQAVE ENITKSTRQE SRYRTIRVEF GAIVKRVLAR PAVLSCKLNS KKNIEFRAEI
LDETDTATSA DDGHTYRKLL CVAFDAAVFL AYLNDSFIHF VFHDGVFESL DDRKKRCLLD
ELRAWGAEGL QQIITVIDSD LPLGEDGSRM AFDDREIVRL LHDQGDGGRL FRMPPW