Gene Haur_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3404 
Symbol 
ID5735265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4286733 
End bp4287818 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content51% 
IMG OID641280551 
Productanti-sigma-factor antagonist 
Protein accessionYP_001546168 
Protein GI159899921 
COG category[T] Signal transduction mechanisms 
COG ID[COG1366] Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAC TACTGGCATG GTTAGCTCAG GTTGATCATC CGCTTGAGGA TGTGCGGCGA 
CGTGGCCAAA TTATCATCAG CGTGATGTGT GCGCTCCTCG TTATTGTTGT GATTGCCATT
CCTTCCCTGC TCTTTAATCC GAAACCATTG ACCTCGGCTG TGGCTTTGGG CTTAGGCTTG
GTTTTGGCGG TGGTGGTGAT TCCGTTGGCG CGGCGTGGCA AAGTAACGTT TGCTGGCTGG
ATTGTGGTGA TTACCACCTC CGTGATCGTC TCGATTCCGA TGATGTTGCG CGGCGAAACC
AGCTATACCC TAGCCTATTT ACTGGTTCCA GTCTTGATTG CTGGGGTCGT GCTGCGCCCA
TGGCAAATTT TAATGGTCTT GGTTGGGGCA TGGGTGATTA TTGGCACATT GGCGTTTGTC
TATCCCACAA CCGATCAGGT TGCAACAACT GGTGGGGTGG TTACCCATGC GATCTTGATT
ACCATGATTG GTTCGATTAT TAGTTTTGTG AATAGCCGAA TTACGGTTGG GGCTTTTCAT
GCGGTTTCCG AAGGCCAGCG CACGATCGAG CAAAATGCCA AACAATTAGT CGAATTAAAC
TCCTCGTTGG AAGCCCAAGT CCATGAGCGG ACTGCCGACC TTGAAAACGC TCTCATGAAA
TTGCAAGATC GGGCTAGCAC CCAAGAACGC TTGCTTGATG AGATTGAACA GCAGCGTGAA
GTGATTCGCG AGATGAGCGT GCCAGTTTTG CCAGTGGCGG CCAAAGTATT GGTGATGCCA
TTGATTGGAG CGCTCGACAG TGAACGCCTG ACCCGCCTGC AAGAGAATGC CTTACAAGCG
GTGCAGCGTC AGTCGATCAA ACATTTGGTG CTGGATATTA CCGGAGTTGT GGTGGTTGAT
AGCCAAGTTG CCCAAGGCTT TATTAGCGTG GTGCGCTCAG TACGCTTGCT TGGCGCTGAA
ACCATGTTGG TCGGGATTCG ACCTGAAGTT GCCCAAGCCA TGGTTTCGTT AGGGCTTGAA
TTGGATTCAA TTAGTACCTC GGCGACCTTG CAAGAAGGCT TGCAACGCAT CCCCAACTAT
AATTAA
 
Protein sequence
MKALLAWLAQ VDHPLEDVRR RGQIIISVMC ALLVIVVIAI PSLLFNPKPL TSAVALGLGL 
VLAVVVIPLA RRGKVTFAGW IVVITTSVIV SIPMMLRGET SYTLAYLLVP VLIAGVVLRP
WQILMVLVGA WVIIGTLAFV YPTTDQVATT GGVVTHAILI TMIGSIISFV NSRITVGAFH
AVSEGQRTIE QNAKQLVELN SSLEAQVHER TADLENALMK LQDRASTQER LLDEIEQQRE
VIREMSVPVL PVAAKVLVMP LIGALDSERL TRLQENALQA VQRQSIKHLV LDITGVVVVD
SQVAQGFISV VRSVRLLGAE TMLVGIRPEV AQAMVSLGLE LDSISTSATL QEGLQRIPNY
N