Gene Haur_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4940 
Symbol 
ID5736776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6268003 
End bp6268998 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content53% 
IMG OID641282107 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001547698 
Protein GI159901451 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTAGACA TTGCAATGCC CAAACTCGAA TGTGTCGCAG CGGCGGAGAA CTATGGGCGG 
TTCAAAATTG AGCCTCTGGA TCCCGGGTAC GGGCACACAT TAGGGAACGC GCTGCGACGG
GTGTTGTTGT CGTCCATCCC TGGGGCAGCG GTCATCAAGA TTAAAATTGA TGGGATTTTC
CACGAGTTTT CGCCAATTCA AGGCGTGCGT GAAGATGCTA CCGAGATTGT GTTAAATGTG
AAAGGCATTC GTTTGCGCTC CTACGCCGAA CGCCCAGTCA AGATGATTTT ATCCAAGACT
GGCCCAGGCG TGGTGCGGGC AAGCGATATT GAATGCCCTT CAAATGTGGA GATCGTTAAC
CCCGATCACT ACATCGCAAC CCTCGATAGC TCCGATGCTC GCTTGGACAT GGAGTTGACC
GTGGAACGTC ACCGCGGCTA TCTGCCAGCT GAAAATCGTG ACCCGGTGCC GATTGGCGAA
ATCCCGGTGG ATGCGATCTT CACGCCGGTG CACAAGGTTA ACTACGTTGT GGAACACACC
CGCATTGGCG GGATGACCGA CTTCGACCGT TTGTTGCTTG AAATTTGGAC TGATGGCACG
GTTAAGCCAG GCGATGCTCT CAGCTACGCC GCGCAAGTGC TGGTTCAATA TTCATCGATC
ATCGCCGATT TCAATCGCTT TGACGATGAA CAAGAGCAAG TTGGCGATGC CAATGGTCTG
GTTATTCCCA GCGAGATCTA CGATATGCCT ATCGAAGATC TTGACCTTTC AACCCGCACC
TACAACTGTC TCAAGCGTGC CGACATCACC AAGGTTGGGC AAGTGCTGGA GATGGACGAG
AAGCAACTGC TGGCTGTGCG GAACTTGGGG CAAAAATCCA TGGACGAAAT TCGCGAGAAA
TTAATCGAGC GCAATTTACT GCCAACCTTG CCTTTCAACT CCGCTATTCT GAACACCAAT
GTCGCTGCCC GTCTGAACGA CGGTAGCGCT GAATAG
 
Protein sequence
MLDIAMPKLE CVAAAENYGR FKIEPLDPGY GHTLGNALRR VLLSSIPGAA VIKIKIDGIF 
HEFSPIQGVR EDATEIVLNV KGIRLRSYAE RPVKMILSKT GPGVVRASDI ECPSNVEIVN
PDHYIATLDS SDARLDMELT VERHRGYLPA ENRDPVPIGE IPVDAIFTPV HKVNYVVEHT
RIGGMTDFDR LLLEIWTDGT VKPGDALSYA AQVLVQYSSI IADFNRFDDE QEQVGDANGL
VIPSEIYDMP IEDLDLSTRT YNCLKRADIT KVGQVLEMDE KQLLAVRNLG QKSMDEIREK
LIERNLLPTL PFNSAILNTN VAARLNDGSA E