Gene Hoch_1266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1266 
Symbol 
ID8543648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1668731 
End bp1669705 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content70% 
IMG OID646385984 
ProductRNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003265719 
Protein GI262194510 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.68316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.012517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG AAGCTGTCGC GCCGCCCGCC GATCTCGCCG TCGCCCGCGA GCAATTCCTG 
GCCCTGGTCG ACACGGTACG GCCCGATCTG CACCGCTACG CCTCGCGCCT CGTGGGCTCG
GCCATCGACG GCGAAGACGT GGTGCAAGAC ACCCTGGCCA AGGCCCTGTA CGCGCTCAGC
CAGTCGTCCG AGATACCGCC GCTGCGGCCG TGGTTGTTTC GCATCGCCCG CAACACGGCC
ATCGACCTGC TGCGCCGCTA CGAGCGCAAA CACGCCACCC CGCTACCGCC CGAGGAGCAC
ATGCCGAGCG ACGAGGAACC CCGCGATCCC GAGGTACTGC GCGCCGCCAT CGCCACCTTC
ATGACCCTGC CGCTGAGCCA GCGCAGCGCC GTCATCCTCA AGGACGTGCT CGGCGAGTCG
GTGGACGACA TCGCCGGCCA CCTCGATACC AGCGCGGCCG CGGTCAAGTC GCTGCTGGTG
CGCGGACGCC AGGCGCTCAA AGCGCGCCTG GCCGAGGCCG AGTCGGGGGC CGAGGCGGAC
GCTGGGGCTG CGGCCGCGGA CGCGATGTCG CCCGAGCACC GCGCGCTGGT GCATCGCTAC
GTGGATCTGT TCAATCAGCG CGACTGGGAC GGTGTGCAGG CGCTGTTGCT CGACGAGTGT
CGGCTCGACC TGGTGTCCAA GTCGCAGCGC AAGGGCAAAG CCGTGGGCAT GTATTTCGGA
CGCTATGCCC AGGAGCGCGA GCTGCTGTTT CGCGCCGGAC GCTGCGAGGG ACGCAGCGCC
ATCGGCGTGT TTCGCCGGGC CGAGGCCAGC GCCGAGAGCG GGGCCGCGAG CCGGGCCGCT
AGCGGCGCCG AGACCCTGGT CTACATCATC CTGCTCGAGA GCGAGGACGG CCGCGTGAGC
TTCATCCGCG ATTTCCGCTA CGTGCCCTAT CTCGCCGGCG AGCTCGACTT CGTGGCCGAC
GCCGCTCACT CGTAG
 
Protein sequence
MSTEAVAPPA DLAVAREQFL ALVDTVRPDL HRYASRLVGS AIDGEDVVQD TLAKALYALS 
QSSEIPPLRP WLFRIARNTA IDLLRRYERK HATPLPPEEH MPSDEEPRDP EVLRAAIATF
MTLPLSQRSA VILKDVLGES VDDIAGHLDT SAAAVKSLLV RGRQALKARL AEAESGAEAD
AGAAAADAMS PEHRALVHRY VDLFNQRDWD GVQALLLDEC RLDLVSKSQR KGKAVGMYFG
RYAQERELLF RAGRCEGRSA IGVFRRAEAS AESGAASRAA SGAETLVYII LLESEDGRVS
FIRDFRYVPY LAGELDFVAD AAHS