Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1725 |
Symbol | |
ID | 8544107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2344079 |
End bp | 2346424 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646386432 |
Product | FHA domain containing protein |
Protein accession | YP_003266167 |
Protein GI | 262194958 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.184636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.609849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAGG AGCCGCTATC GCTCGTGTTC GAGTTCGCTC GCACGGATAC GCCCGAGGAT GCGTACGCGT TTCGCTACCG GCCGCAGGAC TACACGCTGC GCACCACCCA CGGCGGACGC AAGCGGGTGC ATCTCGACTG GAGCGAGGAG TTTCTCGGCC AGCTCGACGC CCTGCACGCG CCCTACTGCG ATCCGTCCAC CGCGCAGCGC GTGGGCCGCA CCCTCGGCAC CTTTCTCGAA CCCTCGGGCT GGACCTGGCA CGCCCAGACC ATCGCCCACG CCTGTCAGCA GTCGCGCCCC GTGCTGCTGA CCATCCGCTC GGCCGCGGCC GAACTCTACG CCCTGCCCTG GGAGCTGTTG CCGCTCGAGG CCTCGGGCCA ATGCATCGGC GAGTTGCCGG GCGCGCTGGT GCGTTACGAA TGGCCTGAGA CCCACACCGT GCCGGCCAAG CACCTCAGCG AAGAGCGCGC CGGTCGCGTC CTCGGCGCGT GGACGGCGGC CGGCGGGGAA GTCCCGGCCG CCGAGCACAT CGACGCCCTG CGCGCCGCCT TCAGCGCCGC CGGGCGCGAA TTCGACAGCG ACAGCGACGT GGTCGCGCAC GCCAGCGTCG GCAAACTCGC CGACGCCTTC GAACAAGCCC AGACCGAGGG CCGGCCCTTC ACCGTGCTCC ACCTGCTGTG CCACGGCGGC CGCGCCGGGC GCACCTGGGG CCTGGTGTTC AACGGCGAAG ACGACGAGGA CGAACCCGTC GCCGTCGACA CCTGGCGGCT GCGGCAACTG CTAGCGCCGC ACGCCGGCAC CCTGCGCCTG GTGGTCATCT CCGCGTGCGG CAGCGCCTAC GGGCGCGAGT TCGACAGCGT CGCCCAGGCC CTGCACCGGG CCGGCATCCA GGCCGTGGTG GCGTCGCGCT TTCCGCTCTC TATCTCCGGC TCGGTGCGCG TCGCCCAGAC CCTGTACGAG GCCATGCTGG TCGAGCAGCA GCCGCTCGAG GAGGCCTTTT TGCGCACCCG CCGCGCCCTC GCCCGCGACG CCACGCGGCT CGATTGGGCC GGTCTACAGC TCTACGCCCG CCAGCAGGAC GGTCACCGCA CCAAACCGCT GCAACTCGGA CATATGGACG GAGAACGCAA CCGCAGAGCC GCAAGCGAAG ACCCGCGCCC GGCGCTATCC GATCTCGACC TCAGCAGCCT CATGGAGCTG CAGAACCAGG TCCGCTCGGC CATCGCCACG CGCTTCGAGA CCAAACTGGC GCTGATGTGC GTCGAGCTGG TCGATGTCGA CTTCCGCGCC GGCCCCTCGG AGGCCGGGGT CCAGAAGCGC TGCTACGAGC TGCTGGCCGA AGTCGCCGCG CCCATGCAGG GACGCATCTT CGCCACCTTG GGCGACAGCC TGCGGGTGTG CTTTCCCAAC GTCAAGAGCC TGCTGCGCGC GGTCTTTGAT TTCGTCGACG CCCTGACCGA ACACAACTAC GCCGCTCCGC GCGAGGACCA ACTCGTGGTC GGCATCGGCC TGCACTACGG CTCGGCGCTC AGCAACGACC GCATCGTCGT CGGGCCGGCC GTGGACACCG CCGCGCGCGT GGCCGCGATG GCGGGTGACA GCGAGATCCT GCTCACGCAC GACACCCTGG TGCACTTTCC CCGCGTCACC CAGGCCATCT GCCGGCCCGT CACCCAGCCC ACGCATCGCA GCGAGGACGC CGAGGATCTC TCGCTGTACT CGCTGCCGTG GAGCAACGAG CAGCGCCTGC CGGCCACCAT CATCGTCGAG GAGACCGGCG AGGTCATCCC GCTGCCGCGC CAGGACATCA TCTCCGTGGG TCGCCTCGAC GCGCTGGCCG ATGGCAGCAA GGCCAACGAC GTCGTGCTCA CCCACCCCAA CGAGCGCGCC CAGCGGCTGA TCAGCCGCTG GCACTTCGAG CTGCGGCGCA CCAAGGACGG CTACGTGCTG CGCGCGCTGT CGAACCAGCT CACCGAGGTC GACGGCCTGG CGGTCGAGTG CGGCAACGAG GTGCCCATCG GCCCCGGCAC CACCGTGTGC CTGGCGTACG TGATGACCCT GCGCTTTCAC GAATACGAGC GCGCGCAGTC GGTGCGCGGT GAGGACACCC TCATGCGCCC CGAGAACGCG CAGACCGGCG AGATTCAGAC GCTGCACACG CGGACGCAGA CGCACACGCG GACGCTCACG CCCATCGGCG CCGAATCCGG CCCGCACCGG CAGACCGAAG TCGACGCCGC GACGCCGCCG GTGGTGCTCG ATCCCGACAC CGCCGTCGAC CTTCCCACTG GCAACACGCG GCAAGTCCGC CGCGCAGAAG CAAGCAACCC CAAAAAAGGA GTATGA
|
Protein sequence | MSEEPLSLVF EFARTDTPED AYAFRYRPQD YTLRTTHGGR KRVHLDWSEE FLGQLDALHA PYCDPSTAQR VGRTLGTFLE PSGWTWHAQT IAHACQQSRP VLLTIRSAAA ELYALPWELL PLEASGQCIG ELPGALVRYE WPETHTVPAK HLSEERAGRV LGAWTAAGGE VPAAEHIDAL RAAFSAAGRE FDSDSDVVAH ASVGKLADAF EQAQTEGRPF TVLHLLCHGG RAGRTWGLVF NGEDDEDEPV AVDTWRLRQL LAPHAGTLRL VVISACGSAY GREFDSVAQA LHRAGIQAVV ASRFPLSISG SVRVAQTLYE AMLVEQQPLE EAFLRTRRAL ARDATRLDWA GLQLYARQQD GHRTKPLQLG HMDGERNRRA ASEDPRPALS DLDLSSLMEL QNQVRSAIAT RFETKLALMC VELVDVDFRA GPSEAGVQKR CYELLAEVAA PMQGRIFATL GDSLRVCFPN VKSLLRAVFD FVDALTEHNY AAPREDQLVV GIGLHYGSAL SNDRIVVGPA VDTAARVAAM AGDSEILLTH DTLVHFPRVT QAICRPVTQP THRSEDAEDL SLYSLPWSNE QRLPATIIVE ETGEVIPLPR QDIISVGRLD ALADGSKAND VVLTHPNERA QRLISRWHFE LRRTKDGYVL RALSNQLTEV DGLAVECGNE VPIGPGTTVC LAYVMTLRFH EYERAQSVRG EDTLMRPENA QTGEIQTLHT RTQTHTRTLT PIGAESGPHR QTEVDAATPP VVLDPDTAVD LPTGNTRQVR RAEASNPKKG V
|
| |