Gene Haur_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4058 
Symbol 
ID5735916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5181528 
End bp5182640 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content48% 
IMG OID641281209 
Productanti-sigma-factor antagonist 
Protein accessionYP_001546818 
Protein GI159900571 
COG category[T] Signal transduction mechanisms 
COG ID[COG1366] Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAA CCATCGCACC GCCCCGTCAG ATTGAAGATA TGCTGCAAAC CTCGAATGAG 
CAACGGATTA AGTTGTTTAG TTTGGTTGAA GGTTTAGCAT TGTTGGCCGT TATGCTGATT
TTGCTGGTTG CTGGCAACGG CCAACAAGTG CAAATTGGCT TGAGCATTTG TGCGATTGTG
GGCACGATGT TGGCCGTTAC CTTTGCCTTG CGGCGTTCGC GCTATGTCGA ATGGCTGATT
TACGTTAATC TAGCCTCGAT TACCTTGATG CTTTCGTTAA CTGGGCCAAT TATTGGCGAA
GTTGATGGCT CGGTCTGGGT TTTGTTTCAA GTTCCACCAT TAATTGCCAC AATTGTGTTA
AACACTTCGC GGGCAACCAC TGTTATTTGC GTTGCATCAT CAATTATTCT TTCGATTATT
ATTGGTGGAG AGTTAGCTGG TTTTATTCCA ATTAAATTTT TGGTGCCTAA ATCGGCCTTA
TTGATCAACT TCTTTTGCCA ATTGCTTGTA CTGGGAATTA TTGCCACGAC CGTGCATTTA
TTGGTTGGGC GGGCCAAACG TGCGTTTGCC GTGGTGGCCA AAACTGAGGC CAAGTTGGCC
CAGCAATTAG CCAACGAGCG TGAATTGGCG CTGCAACGTG AGCAACTGAA TGCCCAATTA
AGCCAAAGCC TGGCTGAAAT TAGCCAGCGC GATAGCCAAA TTCAAGCCGA ACAAGCGGCT
CAGGCGGCTT TGCGCGACCA ATTGCGTCAA CTGAGTTTGC CTGTGATTCC AGTACTCAAA
CAAACTGTGG TGATGCCATT AATCGGTGAG CAATTAGCCA ACTCCAGCGA GGGGATTGAA
GAAACCTTGC TGAATGGGAT TAGCCAGCAT CGCGCCAAAA TTGCCATTTT AGATGTAACC
GGGGTTCCTA CGATTGATAC CGAGCTTGGG CGGCGCTTAA TTCAAGCCAC GGCGGCGGCG
CGTTTGCTTG GAGTTCAGAC GATTATCGCC GGAATTCGGC CTGAGGTTGC CCAAACCTTG
GTGAGTTTGG GGATTGATTT TAGCAGCGTG ACCACTGTTG CCAGCTTGCA AGATGGCGTT
GCAATGGCGA TTAAACGTTT AGGCCTTGGC TAA
 
Protein sequence
MAETIAPPRQ IEDMLQTSNE QRIKLFSLVE GLALLAVMLI LLVAGNGQQV QIGLSICAIV 
GTMLAVTFAL RRSRYVEWLI YVNLASITLM LSLTGPIIGE VDGSVWVLFQ VPPLIATIVL
NTSRATTVIC VASSIILSII IGGELAGFIP IKFLVPKSAL LINFFCQLLV LGIIATTVHL
LVGRAKRAFA VVAKTEAKLA QQLANERELA LQREQLNAQL SQSLAEISQR DSQIQAEQAA
QAALRDQLRQ LSLPVIPVLK QTVVMPLIGE QLANSSEGIE ETLLNGISQH RAKIAILDVT
GVPTIDTELG RRLIQATAAA RLLGVQTIIA GIRPEVAQTL VSLGIDFSSV TTVASLQDGV
AMAIKRLGLG