Gene Hoch_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1725 
Symbol 
ID8544107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2344079 
End bp2346424 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content70% 
IMG OID646386432 
ProductFHA domain containing protein 
Protein accessionYP_003266167 
Protein GI262194958 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.609849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGG AGCCGCTATC GCTCGTGTTC GAGTTCGCTC GCACGGATAC GCCCGAGGAT 
GCGTACGCGT TTCGCTACCG GCCGCAGGAC TACACGCTGC GCACCACCCA CGGCGGACGC
AAGCGGGTGC ATCTCGACTG GAGCGAGGAG TTTCTCGGCC AGCTCGACGC CCTGCACGCG
CCCTACTGCG ATCCGTCCAC CGCGCAGCGC GTGGGCCGCA CCCTCGGCAC CTTTCTCGAA
CCCTCGGGCT GGACCTGGCA CGCCCAGACC ATCGCCCACG CCTGTCAGCA GTCGCGCCCC
GTGCTGCTGA CCATCCGCTC GGCCGCGGCC GAACTCTACG CCCTGCCCTG GGAGCTGTTG
CCGCTCGAGG CCTCGGGCCA ATGCATCGGC GAGTTGCCGG GCGCGCTGGT GCGTTACGAA
TGGCCTGAGA CCCACACCGT GCCGGCCAAG CACCTCAGCG AAGAGCGCGC CGGTCGCGTC
CTCGGCGCGT GGACGGCGGC CGGCGGGGAA GTCCCGGCCG CCGAGCACAT CGACGCCCTG
CGCGCCGCCT TCAGCGCCGC CGGGCGCGAA TTCGACAGCG ACAGCGACGT GGTCGCGCAC
GCCAGCGTCG GCAAACTCGC CGACGCCTTC GAACAAGCCC AGACCGAGGG CCGGCCCTTC
ACCGTGCTCC ACCTGCTGTG CCACGGCGGC CGCGCCGGGC GCACCTGGGG CCTGGTGTTC
AACGGCGAAG ACGACGAGGA CGAACCCGTC GCCGTCGACA CCTGGCGGCT GCGGCAACTG
CTAGCGCCGC ACGCCGGCAC CCTGCGCCTG GTGGTCATCT CCGCGTGCGG CAGCGCCTAC
GGGCGCGAGT TCGACAGCGT CGCCCAGGCC CTGCACCGGG CCGGCATCCA GGCCGTGGTG
GCGTCGCGCT TTCCGCTCTC TATCTCCGGC TCGGTGCGCG TCGCCCAGAC CCTGTACGAG
GCCATGCTGG TCGAGCAGCA GCCGCTCGAG GAGGCCTTTT TGCGCACCCG CCGCGCCCTC
GCCCGCGACG CCACGCGGCT CGATTGGGCC GGTCTACAGC TCTACGCCCG CCAGCAGGAC
GGTCACCGCA CCAAACCGCT GCAACTCGGA CATATGGACG GAGAACGCAA CCGCAGAGCC
GCAAGCGAAG ACCCGCGCCC GGCGCTATCC GATCTCGACC TCAGCAGCCT CATGGAGCTG
CAGAACCAGG TCCGCTCGGC CATCGCCACG CGCTTCGAGA CCAAACTGGC GCTGATGTGC
GTCGAGCTGG TCGATGTCGA CTTCCGCGCC GGCCCCTCGG AGGCCGGGGT CCAGAAGCGC
TGCTACGAGC TGCTGGCCGA AGTCGCCGCG CCCATGCAGG GACGCATCTT CGCCACCTTG
GGCGACAGCC TGCGGGTGTG CTTTCCCAAC GTCAAGAGCC TGCTGCGCGC GGTCTTTGAT
TTCGTCGACG CCCTGACCGA ACACAACTAC GCCGCTCCGC GCGAGGACCA ACTCGTGGTC
GGCATCGGCC TGCACTACGG CTCGGCGCTC AGCAACGACC GCATCGTCGT CGGGCCGGCC
GTGGACACCG CCGCGCGCGT GGCCGCGATG GCGGGTGACA GCGAGATCCT GCTCACGCAC
GACACCCTGG TGCACTTTCC CCGCGTCACC CAGGCCATCT GCCGGCCCGT CACCCAGCCC
ACGCATCGCA GCGAGGACGC CGAGGATCTC TCGCTGTACT CGCTGCCGTG GAGCAACGAG
CAGCGCCTGC CGGCCACCAT CATCGTCGAG GAGACCGGCG AGGTCATCCC GCTGCCGCGC
CAGGACATCA TCTCCGTGGG TCGCCTCGAC GCGCTGGCCG ATGGCAGCAA GGCCAACGAC
GTCGTGCTCA CCCACCCCAA CGAGCGCGCC CAGCGGCTGA TCAGCCGCTG GCACTTCGAG
CTGCGGCGCA CCAAGGACGG CTACGTGCTG CGCGCGCTGT CGAACCAGCT CACCGAGGTC
GACGGCCTGG CGGTCGAGTG CGGCAACGAG GTGCCCATCG GCCCCGGCAC CACCGTGTGC
CTGGCGTACG TGATGACCCT GCGCTTTCAC GAATACGAGC GCGCGCAGTC GGTGCGCGGT
GAGGACACCC TCATGCGCCC CGAGAACGCG CAGACCGGCG AGATTCAGAC GCTGCACACG
CGGACGCAGA CGCACACGCG GACGCTCACG CCCATCGGCG CCGAATCCGG CCCGCACCGG
CAGACCGAAG TCGACGCCGC GACGCCGCCG GTGGTGCTCG ATCCCGACAC CGCCGTCGAC
CTTCCCACTG GCAACACGCG GCAAGTCCGC CGCGCAGAAG CAAGCAACCC CAAAAAAGGA
GTATGA
 
Protein sequence
MSEEPLSLVF EFARTDTPED AYAFRYRPQD YTLRTTHGGR KRVHLDWSEE FLGQLDALHA 
PYCDPSTAQR VGRTLGTFLE PSGWTWHAQT IAHACQQSRP VLLTIRSAAA ELYALPWELL
PLEASGQCIG ELPGALVRYE WPETHTVPAK HLSEERAGRV LGAWTAAGGE VPAAEHIDAL
RAAFSAAGRE FDSDSDVVAH ASVGKLADAF EQAQTEGRPF TVLHLLCHGG RAGRTWGLVF
NGEDDEDEPV AVDTWRLRQL LAPHAGTLRL VVISACGSAY GREFDSVAQA LHRAGIQAVV
ASRFPLSISG SVRVAQTLYE AMLVEQQPLE EAFLRTRRAL ARDATRLDWA GLQLYARQQD
GHRTKPLQLG HMDGERNRRA ASEDPRPALS DLDLSSLMEL QNQVRSAIAT RFETKLALMC
VELVDVDFRA GPSEAGVQKR CYELLAEVAA PMQGRIFATL GDSLRVCFPN VKSLLRAVFD
FVDALTEHNY AAPREDQLVV GIGLHYGSAL SNDRIVVGPA VDTAARVAAM AGDSEILLTH
DTLVHFPRVT QAICRPVTQP THRSEDAEDL SLYSLPWSNE QRLPATIIVE ETGEVIPLPR
QDIISVGRLD ALADGSKAND VVLTHPNERA QRLISRWHFE LRRTKDGYVL RALSNQLTEV
DGLAVECGNE VPIGPGTTVC LAYVMTLRFH EYERAQSVRG EDTLMRPENA QTGEIQTLHT
RTQTHTRTLT PIGAESGPHR QTEVDAATPP VVLDPDTAVD LPTGNTRQVR RAEASNPKKG
V