Gene Hoch_1825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1825 
Symbol 
ID8544207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2516798 
End bp2518045 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content66% 
IMG OID646386531 
ProductHipA N-terminal domain protein 
Protein accessionYP_003266266 
Protein GI262195057 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.401045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.66095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGTG CCAGCGAAGC ATCTCGGCGG GGCCTGGACG TCCGGGTCTC AAACGTCCGG 
GTGGGCTCCC TGGCTCGAGA CGCAGGCGGC GACCTGCGCT TTACCGCCGA CCGCGCATGG
CTCGAAGACG GGCAGCACCC GCCTCTGGGT CTGACGTTCT TGCAAGACCC TGCGCCGCGC
GTGCAACGCG GCCTCGTGCC CGTATGGTTT GAAAACCTCC TGCCCGAGCG CGGGACTCCG
ATGCATCGCT GGATCTGCCA ACAACACGGA CTCCGCGAGC GCGACGAAGC AGCCCTGCTG
CAGGTACTCG GTCATGATCT GCCAGGCGCC GTAGAAGTCA GCGGCGACAT CGACGAGCGC
GAAGACGAAG CCACCGAAGC GCCAGAAGAT GGCCGATTTC GTTTCTCGCT GGCGGGCATG
CAGCTCAAGT TCTCCATGCT ACTCGAAGGG GATCGCCTGT CGCTGCCGCT CCGCGGAGAG
ACCGGACACT GGATCGTAAA GGTTCCCGGA AACGAGCTTC CTCAGGTCCC CGAAGTCGAG
GCAGCAACCC TCACCTGGGC GGAGGCAGCA GGCTTCGCCA CGCCTCGCCA TCGAGTGATG
CCCCTCAAAG CGCTCGCCGG TATCGACGCC GCGCGCCTGG GTCAAGCGCA CTGCGTACTC
GCCGTCGAGC GCTTCGACCG TCGCGCCGAC ACGCGAGTTC ACCAGGAGGA TTTCGCCCAG
GCGCTCGAGA TCCGTCCCTC CGACAAGTAC GGGGCCCGCA ATCGCGCCCC GACCTATGAC
AGCCTCGCCC GCCTGGTACG GGACGCATGT GGCATCGAGG GGCAGAAAGA GTTCATCCGA
CGCGTGGCCT TCGTTGTCGC GTCGGGCAAC AGCGACGCTC ATCTCAAGAA CTGGTCGTTT
CAATGGGGCG CGTCCCACCG CCCCCGGCTC AGCCCTTGTT ACGACCAGGT GGCCACCATC
TCGTGGCCAG AATTTGGCTG GAACGCGGCC GGGGGCGCGG AGTTAGCGCT CACCCTGGGA
CGCTCCAAAC GCTTCGGCGA ACTCGACCGC AGCCGACTGC GCCTGTTCGC AGAGCGCGCC
GGTGCTCCCG ATGGAGAGGC GTGGTTCCTC GATGCGCTCG ATCAAATTCG CAGCGCGTGG
TCGGGACTCG AAGCGCAGGC GCCCGCGCGC ATGCGTGACG CGCTGCTCGA ACACTGGCAA
AAAGTGCCCG TCCTTTGGGA CATGGGCGGT CTCCCCGGTG CGAGATGA
 
Protein sequence
MKRASEASRR GLDVRVSNVR VGSLARDAGG DLRFTADRAW LEDGQHPPLG LTFLQDPAPR 
VQRGLVPVWF ENLLPERGTP MHRWICQQHG LRERDEAALL QVLGHDLPGA VEVSGDIDER
EDEATEAPED GRFRFSLAGM QLKFSMLLEG DRLSLPLRGE TGHWIVKVPG NELPQVPEVE
AATLTWAEAA GFATPRHRVM PLKALAGIDA ARLGQAHCVL AVERFDRRAD TRVHQEDFAQ
ALEIRPSDKY GARNRAPTYD SLARLVRDAC GIEGQKEFIR RVAFVVASGN SDAHLKNWSF
QWGASHRPRL SPCYDQVATI SWPEFGWNAA GGAELALTLG RSKRFGELDR SRLRLFAERA
GAPDGEAWFL DALDQIRSAW SGLEAQAPAR MRDALLEHWQ KVPVLWDMGG LPGAR