Gene Hoch_5478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5478 
Symbol 
ID8547891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7517169 
End bp7519190 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content69% 
IMG OID646390151 
ProductFHA domain containing protein 
Protein accessionYP_003269854 
Protein GI262198645 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.366631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.328506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAGC TGGTCATCCA AGATGACGAG GGCAAGACTA CGGTCGTTCC CTTGATCCGC 
GATGAAATTA CGGTCGGTCG GAAGGAAGGC AATACCATCC GTCTCACCGA GCGTAATGTC
TCGCGGCGTC ACGCACGAAT TCTGCGCGCC AACGGCGAGG ATATCGCCAT CGAGGACCTC
GAGAGTTACA ACGGGGTTCG CGTCAACGGC AGTCGCATCA AAGGGCGGCA GCCGCTATCG
CTCAGCGATC GCGTACAGAT CGGCGACTAT CTGATCGAGA TCAAAGCCGA CGATGAGGCC
GTCGCCGCCG AGGTGCCGGT CGCGCCGCCG GACAGCGAGC CCACGCAGCC GATCGATGCG
CTCGCCCGGC CCCCGGGCAG CCCCGAAGCC CGCGCCGCCG CCGCCCCGGC CGCCGCCGCC
GCCGCCGCCG CGGTCGAGAG CGGGGTCCCG GTCCCGCAAG TCGGACCCTA CGCCGAGACG
ACCAAGATGC CGTCTCAGCT CTCGCTCGAG ACGACCAAGA TGCCGTCTCA GCTCGTCCCC
GAGGCGCTCA AGCAGGTCGC CGAGGGCGAG GCACCGCCGG CCCAGCCCGA GCTCTCGGAC
GAGACGCCGA CGCCCGACGT CGGCCCCAAA GTAGAGACCC GCCCGGCTCG CTTCATCGTG
CTCAGCAGCA ACCTGGCGAG CCTGGAGTAC GAGATCGGCG AGGGCACCAC GGTCCTCGGG
CGCACCGAGG ACAACGACGT CGTCATCAAT CACCGCTCGA TCTCGCGCAA CCACGCCAAG
GTCATCCACG AGGGCGGTCG CTACACCATC GTCGACCTCG AGTCCTCGAA CGGCGTGCGC
ATCAACAACG AGGAGTTCGA CAAGGTCGAG CTGCGCCGCG GTGATCTCGT CGATCTCGGA
CACGTCCGGC TGCGCTTCGC CGATCCCGAG GACGACTACG TTTTCACCAA AGACGACATC
AGCGATGTCG CCACCGGCGG CAACAAAGGG CTGTGGTACG CGCTGCTCGC GGTGCTGGTG
GTGCTGGTCG GCGTCGGCGT GTTCGCCCTG CTGAGCGACG GCGACGACAG CGGCCCGGCC
ACGGCCAACG AGAACGGCGA CAACGGCTCG CTCGTCGCCG GGACCACTGC CGGTAGCAGC
GATGAGAAGA CCGGCGAGGA CAACGGCCAG GACAGCGGCG AGAACAGCGG CGAGGGCGAC
GCCCTGGCCG AGGCCGAGGC CGCGGCGGCC GCGCGCGACT GGGATGCGGC CAGCGCCAAG
GCGCGGACCG CGCTCACGGC CGCCGCTGAT GACGCGGACA AGCAGCAGGC CGCGCAGGCG
CTGATCGACA CCGCCGAGCG CGAGCGCGAG TTCGCCACCC ACTACGCGGC CTTGCAGCAG
GCGGCCGAGG CCGGCAAGGC GGCCGCGGTG CGCGAGCACC TCGACGCCAT CGACGACAAC
TCGGTGTACC GAGACGACGC CGAAGCCGCG CGCGACAGCG CCCGCGATGC CTATATCGAA
TCGGTGCTCG AGCGCGCCGA GGCGCTGCGC GACGCGCGCA AATGCGCTCA GCTCGACGCG
CTCAGGGGCG AGGCCGCCGG GGCCTGGCCC GCGGCCGGCG ACGCCGTCGG CGAGATCGCC
TCGACCTGCC AGGCCGCCGT GGCCCAGAAC ACGCGTCGCC CGCCGCCGCG CGACCCGCCG
CCGCCGCGCG ATCCGCCGCC GCAGAACACC GATAACGGCA GCAGCGGCAA CAACAACAAC
AACAGCCGCG GTGGCAAGTC CTTTGACGAG CTCCTGGCCG AGGCCGAGGA CGCGCAGAAA
GCCGAACTTT ACGGAAAGGC GCGGCGGCTG TGCGGGCAGG CCCTCGAGAT CAAACCGCGC
GATCCGCAGG CGCTGTTCAT CTGTGGTGTT GCATCCTGTC GCCAGGGCAA CGCGGCGCGC
GCCAAGCGCT ACTACAACCT GGCATCGGGG GAGAGAAAGC GGGCGATCTA CCAGCTCTGC
TACGTCGAGG AGGGCGGTCA GACCCGCAAT TTGCTGGATT GA
 
Protein sequence
MFKLVIQDDE GKTTVVPLIR DEITVGRKEG NTIRLTERNV SRRHARILRA NGEDIAIEDL 
ESYNGVRVNG SRIKGRQPLS LSDRVQIGDY LIEIKADDEA VAAEVPVAPP DSEPTQPIDA
LARPPGSPEA RAAAAPAAAA AAAAVESGVP VPQVGPYAET TKMPSQLSLE TTKMPSQLVP
EALKQVAEGE APPAQPELSD ETPTPDVGPK VETRPARFIV LSSNLASLEY EIGEGTTVLG
RTEDNDVVIN HRSISRNHAK VIHEGGRYTI VDLESSNGVR INNEEFDKVE LRRGDLVDLG
HVRLRFADPE DDYVFTKDDI SDVATGGNKG LWYALLAVLV VLVGVGVFAL LSDGDDSGPA
TANENGDNGS LVAGTTAGSS DEKTGEDNGQ DSGENSGEGD ALAEAEAAAA ARDWDAASAK
ARTALTAAAD DADKQQAAQA LIDTAERERE FATHYAALQQ AAEAGKAAAV REHLDAIDDN
SVYRDDAEAA RDSARDAYIE SVLERAEALR DARKCAQLDA LRGEAAGAWP AAGDAVGEIA
STCQAAVAQN TRRPPPRDPP PPRDPPPQNT DNGSSGNNNN NSRGGKSFDE LLAEAEDAQK
AELYGKARRL CGQALEIKPR DPQALFICGV ASCRQGNAAR AKRYYNLASG ERKRAIYQLC
YVEEGGQTRN LLD