Gene Hoch_0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0006 
Symbol 
ID8542376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8473 
End bp9663 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID646384794 
Producthypothetical protein 
Protein accessionYP_003264541 
Protein GI262193332 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCTC AGTATCAAGA CCTAGAGCAG CAGAACCACG AGCTGCGCGA AGAGAATCAG 
GCGCTCCGCG CCGAGCTCGA GCGCATCGAA GCGCGCGCGG CCAAAGCCGC CGAAGCGCGC
ACGCGCCTCA TGCGCGGCTC GTGGCGAATC CTGGTTCCGC TCATCGATCG CCAGCGCGTG
GCGCGCTCGT TTGGCAAGCT GGCGCAGACC GCGTCCGAGT TCGCCAACCC GCCCGCGCAT
TGGCCGACGA AGGAGCAGAT CCTCGCCGAG GCCCGCGACT TCATGGAGTC GTGCGTGCGC
TTCACCATCC GCAGGCGCAC GCTGCTATTG GTGTTCTCGC TCCTCGCCGC CGCCATTCCG
GCGTTTCAGA TGTATCTGGT GGTGCAGCAA AACGAGATGA TCGAGAATCA AAACGAGTTC
TTCGGCATTC AGGTATACGA CATCGTCTCG CGCACCATGA CCGAGGGCGA CCGCAACGCG
CGCCAGATGA CGGGCGCTCT GCTGGCCAAC GCCAAAGTCG AGTTTCTCAG CGGCGTGGTC
GAAGAAGCCT TTGGCGCCGG CGGACTCGGC TTTCAGTGGG GCGCGTATCG CCGCGACGAT
ATCGACGCGC AGCAGCGGCG GCTCGAGGAC GCGGCCTTTC GCGGTCACTT GATCCGCTCG
GTGGTGCGCG CGGTGCAGCA TCGCGGTGAC GGCGGAAAGC ATGAGATGGA TGCCGATGAG
CTGCACGCGG CCATCGTGCC CAGCATCCGC CAGATCCTGC GCGACACCGC CGACCGCATG
CCGCAGGTGC TGCGTCTCGG ACGCCAGGAC GGAGACATCG ATCCGGCCTT GCTCGAACAG
GTGGACTATT ACCTGATCCA GGTCGGCGAG CTGCTGCGCG TCTACGGTCG CATCGCGCGC
TCGGCCGATG AAGAGGCCGC GTTCTTCGAC GATATCCGGC CGCTGTTCCA GCGCATCGGC
GGCCGCCGCG ACCTCGAGGA GAGTCGCTTT GCCGAGGTCT ACCGGCCGGT GCTGCAGGAC
TTCCTGTTCG AGCTGGCGCT GCAACCGAAG CTCGATTCGC CGCCCGTGAA CCTCGAGACC
GCCGGCACCT CGCCCGACGA GGCGCTCAAT ACCGGCATCG AGCGCCTGCG CAAAGGACTC
GGCGAGCAGG CCCTCAACTG GAACCTCTTC AAACGACAGG TGGCGCAATG A
 
Protein sequence
MSAQYQDLEQ QNHELREENQ ALRAELERIE ARAAKAAEAR TRLMRGSWRI LVPLIDRQRV 
ARSFGKLAQT ASEFANPPAH WPTKEQILAE ARDFMESCVR FTIRRRTLLL VFSLLAAAIP
AFQMYLVVQQ NEMIENQNEF FGIQVYDIVS RTMTEGDRNA RQMTGALLAN AKVEFLSGVV
EEAFGAGGLG FQWGAYRRDD IDAQQRRLED AAFRGHLIRS VVRAVQHRGD GGKHEMDADE
LHAAIVPSIR QILRDTADRM PQVLRLGRQD GDIDPALLEQ VDYYLIQVGE LLRVYGRIAR
SADEEAAFFD DIRPLFQRIG GRRDLEESRF AEVYRPVLQD FLFELALQPK LDSPPVNLET
AGTSPDEALN TGIERLRKGL GEQALNWNLF KRQVAQ