Gene Hoch_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4073 
Symbol 
ID8546474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5594299 
End bp5595690 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content71% 
IMG OID646388749 
Producthypothetical protein 
Protein accessionYP_003268464 
Protein GI262197255 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.446677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00970707 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATGATT CGACAATGAT TCATCGCGGT GGTGCCTCGA CCCGAAGGTT GGGCCGCGGC 
TCAGCCCCGC ATCTGCTGGT GAAACCGGAT GCGGACGGGA CGGGCGGACA TGCGGAGGCC
GGATACGCGA AGGCCTCACG GGCGGGCACG GTATTGCGCT GGCTGTGTCT TCTGACGGCC
GTGGCCAACA TGGGTGGCAA CGCGCTGCTC ATGGCCTTTC AGGAGCCGCT GTACGAACTG
CTCGGGCTGG CGCCGCCCGC CGACCCCTAC GCGTTGGCGA GCGTCCTCGG CTTCTCCTTT
ACGGCCGGCG CCCTGGCGCT GGCCATTTTC CTGCGCCCGG CGCACGCGGT GCCGATGCTG
GTCGTGGCCA TGCTCGGCAA GGCCAGCTTC GCGCTGGTCA CCTTGAATGC CCACCTGAGC
GGCGGTGTGG CCTGGCCGTG GCTGGCCTTC GCCGCCTGGG ATGCCGTGTT CGTGGTGGTG
TTCTGGATGT ATCTCATCCA TCTGCTGCGC CCCGAGCTGG CGAGCTTCAA CCGGGGCGCG
GTGTTGGCGT CGCCGGTGCC GGCCGGCGAG ACGCGCCAGG CGCTGCTCTT GTGCTACTCG
CTGAGCGGCA ACAGCGGCCG CGCGGTCGAG AGCACGCGGC GTGGCCTCGA GGCCGGCGGC
TACCGCTGCG ATGTGGTCCG CATCAAGCCG CTCGAGCGCG AGCTGTTTCA GTTTCCCTTC
ACCTCGGTGA TGCAGTTCGT GCGCATCGCG CTGCGCGCGG TGCTGCGCCG CCCGGCCGAG
ATCGAGCCGC TCGAGCTGCC GGCCGATTGC GATCCCGACC TGATCGTCGT GGCCTCGCAG
ACCTGGATGG TGGGCGCGGC GGCGCCGGTC GAGGAGCTGT TCCGCCGGCC CGATACCCGG
GCGCTGTTCG CCGGTCGCGA CGTCGCCCTG ATCAACGTGG CGCGCGGCCT GTGGCGCCGC
AGCCAGGCCA TCCTGGCGTC GCGGGTGGAA GAGGCGGGCG GCAACCTGGT GGCCGCGCGG
GCGGAGACCA ACCCCGGCCG CGAGCCCACG CGCACGCTGT CGCTGTTCGT GTTCCTGATG
ATGGGCCGCA CCGGCCATCC CTCGTGGCTG CCCAAGGCGC TGCGACAGCC GCAGTACCTG
AGCGAAGATG CGCTCATGGG TTATGAGCGC TGGGGCACGG CGCTGGCGCG GCGTTCCACG
GCCGGGCTGT CGCCGCTGGC GTGGCAACGC GCGGCCGTGC TCGATGCGGC GCTGCGCGAG
CAGAGCAAGC GCAGCTTGCA GGAGTCGTGG CAGGATCTGC TGCCGGAGCG CGAGACCGAG
GCGCGTGAGA GTCGCGACCC GGTGAGCGTG GCCAACGCGG CCATCACCTC CTCGGTGGAG
GTGGGCGCGT GA
 
Protein sequence
MNDSTMIHRG GASTRRLGRG SAPHLLVKPD ADGTGGHAEA GYAKASRAGT VLRWLCLLTA 
VANMGGNALL MAFQEPLYEL LGLAPPADPY ALASVLGFSF TAGALALAIF LRPAHAVPML
VVAMLGKASF ALVTLNAHLS GGVAWPWLAF AAWDAVFVVV FWMYLIHLLR PELASFNRGA
VLASPVPAGE TRQALLLCYS LSGNSGRAVE STRRGLEAGG YRCDVVRIKP LERELFQFPF
TSVMQFVRIA LRAVLRRPAE IEPLELPADC DPDLIVVASQ TWMVGAAAPV EELFRRPDTR
ALFAGRDVAL INVARGLWRR SQAILASRVE EAGGNLVAAR AETNPGREPT RTLSLFVFLM
MGRTGHPSWL PKALRQPQYL SEDALMGYER WGTALARRST AGLSPLAWQR AAVLDAALRE
QSKRSLQESW QDLLPERETE ARESRDPVSV ANAAITSSVE VGA