Gene Hoch_3979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3979 
Symbol 
ID8546375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5483513 
End bp5484547 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID646388651 
ProductDNA-directed RNA polymerase, alpha subunit 
Protein accessionYP_003268371 
Protein GI262197162 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.254038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGCG CTACCGCCAC CACGACTTCT ACGACCGAGC AGACGCCCTT CATGGCGAAG 
AACTGGCGTG ACCTGATCCG GCCGCGCATG CTCGAGACCG AGCAGAAGAC CGATAATTAC
GGCAAGTTCA GCTGTGAGCC GCTGGAGCGC GGCTTCGGCA CCACGCTCGG CAACTCCTTG
CGCCGCGTCC TGCTCTCGTC GCTGCAGGGT GCCGCCTTCA CGCACGTGAA GATCGAGCAC
GCGCTGCACG AGTTCTCGTC GCTTCCCGGG GTGGTGGAGG ATGTCACTGA CATCATCCTC
AACCTCAAGG AGACGATGCT CAAGGTCGAG GAAGACCGCG TCTACACCGT GCGCCTGGAG
AAGGAGGGCG AGGGCCCCTT CACCGCCGGC GACATCAGCA CGGTGACCGG CGTGAACATT
CTCAACCCCG ATCACGTCAT CGGACACCTG GCCCGCGACG GCAAGATCTC GATGGAGCTG
ATCATCGGCA CCGGCCGCGG CTACGTCTCG GCCGAGCGCC ACACCACCAG CCCGGGCGTC
GGCTACGTGC CGATCGACGC GCTGTACTCG CCGATCCGCA AGGCCAACTT CACGGTCACC
AACGCCCGCG TTGGACAGCA GACCGACTAC GATAAACTCA CCCTCGAGGT CTGGACCAAC
GGCGGCGTCA CCCCCGACGA CGCCGTCGCG TTCGCGGCCA AGATCCTCAA AGAGCAGCTC
AACATCTTCA TCAACTTCGA GGAGCAGGCC GAGCCGGTCG AGCAACACGT CGATGAGGAG
CAGGAGAAGC TCAACGAGAA TCTCTGGCGG ACCGTGGACG AGCTCGAGCT GTCGGTGCGC
TCGGCGAACT GCTTGCAGAA CGCCAACATC AAGTACATCG GCGAGCTGGT CCAGAAGTCC
GAGTCCGAGA TGCTCAAGAC CAAGAACTTC GGCCGCAAGT CGCTCAAAGA GATCAAGGAG
ATCCTCGCCG AGATGGGGCT CTCCTTGGGT ATGAAGCTCG ACAACTGGCC GGGGACGAAT
CCGCTCAAGA AATAG
 
Protein sequence
MESATATTTS TTEQTPFMAK NWRDLIRPRM LETEQKTDNY GKFSCEPLER GFGTTLGNSL 
RRVLLSSLQG AAFTHVKIEH ALHEFSSLPG VVEDVTDIIL NLKETMLKVE EDRVYTVRLE
KEGEGPFTAG DISTVTGVNI LNPDHVIGHL ARDGKISMEL IIGTGRGYVS AERHTTSPGV
GYVPIDALYS PIRKANFTVT NARVGQQTDY DKLTLEVWTN GGVTPDDAVA FAAKILKEQL
NIFINFEEQA EPVEQHVDEE QEKLNENLWR TVDELELSVR SANCLQNANI KYIGELVQKS
ESEMLKTKNF GRKSLKEIKE ILAEMGLSLG MKLDNWPGTN PLKK