Gene Hoch_5107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5107 
Symbol 
ID8547518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7038024 
End bp7039784 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content75% 
IMG OID646389783 
Productresponse regulator receiver protein 
Protein accessionYP_003269488 
Protein GI262198279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTCT TCTCGAGAGT GCTAGTGGTC TGCGCAGAGG CGCGTTTGGC AGAGGCCGTG 
CGCTTCGGCT TCGAACGCGC CGGCGCGAGG GTGTCCACCT GCGCCAGCGC CGACGAAGCC
GCGGCCGCGT TTGCGGACGC CAGCGCGGTG CCCCGGGGCG GGCTCGAGGG CGAGCACGAG
GATGGCGCCG CCGGGTCCGA ATCCGCGGAC GAGACGCGGC CGCCGGCCCT GATCGTCACC
GGCGCGACCG AGGTCGCGGA CGCGCTCGCG CTGCTGCGTT CGCTGCGCGC GGGGCTGGCC
GAGCACGCGC CCTCGGTGCC GGTGCTGTGT CTGGCGCCGC GCATGCTGTC GCGCGCCGAT
GCGCTGGCCG CGGGCGCGAG CGAGCGGCTG CCGCAGCCCG CGCACCTGCG CGACGCGGTC
ACGCTCGGCC GCATGCTGCT CGGCCGCAAC CGGCGCGGCG ACCGCGCGCA CGTGGTGCAA
GGCCACCTCA GCGACTACGA GGGCGTGTTC TTCCTCGCCC GCGCGCTGGC CGCGGTGGGC
CGCAGCGCGG TGCTCAGTCT GGTGCGCGGG CTGCGCCGCG GCGAGCTGCG CTTCCACCGC
GGCGCGATCA CCTCGGCCCA GGTCGGCGCG CTGCACGGCA TGGCGGCGCT GCATCAGCTC
CTGCTGTGGT CGCGCGCGCA CTTCGAGCTG CGCGACGAGG AGGTGGTGCC GCGCCGGCAG
ATTCCGCTGT CGGCCAGCGA GATCCTGATC GACGCGGAGC GCTTTCTCTA CGATATGCGC
TCGCTGCTGG GCGAGCTGAC GCCGGCGGGC GTGTACGCGC CCGCGCACGA GCGCTACCGC
GGTCAGGAGC TGCCGCCGCA AGTGCGCCGC GTGCTGCGCC TGATCGACGG CTATCGGACC
ATGGCCGACG TGGTCGAGGA TTCGCCGTAT CGCGTGCTGG AGACGCTGGC GATCTGTAAC
CGCCTGCTGG CGCTCGAGTG GATCGTCCGC GTCGACGACA TGGATGGCGA GCGCGGCGCG
TCGGTCGGCG GCGGCGTATC GGTCGAGCGC ATGCTGAGCG ACCGGCTGTC CGAAAGCTCG
TCGGCCGCGA TCGGCTCGGT GGAGCTCGAG TCGCTCTCGG GCGATACCAA GCCCATGCGC
ATGCTGCGCC GCGAGGCCAG CGCCTTCGAC GACGAGGACA CCGCCGACGG CCGCGGCGGG
CGCGCCAGCG AGGTCGCCGG CGAGCCGGTG GATTGGTCAG ACGTGCTGCC CACGGAGATG
GCGTCCGGGT TCTCGCCCGT GGTCCCGGCG ACCGCGGCCG CCGGCGAGAT CGAGGCGCAG
CAGGATGACG GCGTGGTGCG CGCGTTCGCC CCCGAGCACA GCGGCGACGA GGAAGACGAC
TCGAGCGCGC TCGCCCGGGG ACTGCGCGCG CGCACGCGCT CGACCCGCAT GCAGACCGCA
CCGCATCCCG ATGACGGCGC CGAGGGCGAG GTCGAGCTGG CCTGGACGCC GGCGGCCAAG
GCCGGCAAGG CCAGCAAGGC CGGTAAGGCT GGCAGAGGCG GCACCACCGG GCGCGGCGAG
GGCGCGTCCA AGCGATCGCC GAGCGCGGCC CCGACGCGTC CGGCCGACGG CGCCGGCGAT
GACGACCAGC GGCCCTCGCT GTGGCGGCGT ATTTTCGGAC GCGCCGAGGC ATCCGCGAGC
GCGGACGATG AGGCGGCGCC CAAGGCCGGC GGCAAGCCCA ATCGCCGCGG CGCGAGCCGG
CGACGCCGCA AGCGCCGCTG A
 
Protein sequence
MSVFSRVLVV CAEARLAEAV RFGFERAGAR VSTCASADEA AAAFADASAV PRGGLEGEHE 
DGAAGSESAD ETRPPALIVT GATEVADALA LLRSLRAGLA EHAPSVPVLC LAPRMLSRAD
ALAAGASERL PQPAHLRDAV TLGRMLLGRN RRGDRAHVVQ GHLSDYEGVF FLARALAAVG
RSAVLSLVRG LRRGELRFHR GAITSAQVGA LHGMAALHQL LLWSRAHFEL RDEEVVPRRQ
IPLSASEILI DAERFLYDMR SLLGELTPAG VYAPAHERYR GQELPPQVRR VLRLIDGYRT
MADVVEDSPY RVLETLAICN RLLALEWIVR VDDMDGERGA SVGGGVSVER MLSDRLSESS
SAAIGSVELE SLSGDTKPMR MLRREASAFD DEDTADGRGG RASEVAGEPV DWSDVLPTEM
ASGFSPVVPA TAAAGEIEAQ QDDGVVRAFA PEHSGDEEDD SSALARGLRA RTRSTRMQTA
PHPDDGAEGE VELAWTPAAK AGKASKAGKA GRGGTTGRGE GASKRSPSAA PTRPADGAGD
DDQRPSLWRR IFGRAEASAS ADDEAAPKAG GKPNRRGASR RRRKRR