Gene Haur_4375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4375 
Symbol 
ID5736932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5589214 
End bp5590764 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content53% 
IMG OID641281537 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_001547135 
Protein GI159900888 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTG CTCGCCGCCA TCGCGATGTG ATTAGCATCA GCCTGTTGCT CTCGGTCATT 
ACGATTGTGA TGTACCTTGG TGAGGGGCGT TGGGCGGCTG CGCCTGCGTT ACGCTACCTC
TATTTGATTC CAATCGCCCA AGCTGCGATG GGCTTTGGCT TGATGGGCAG CATGGCCGTG
GCGATTTTGG CTGATCTGCT GTTTGCGCCC TTGGTTGCCA CGGCCTTGGC CAAATATGGG
ATGTTTGGTG CTCCTACAGT TGAAATTATT GTGACCCTCG TTTTGATGCC GGTCTTGGCC
TATTTTGCGG GCAGCGGTTG GGGTCGGCTT AGTCGCCAGC GCGAGCTTTA TCAATTTTTG
AGCCGCATGG GCGATTTATT TGGCCGTTCG CTACCCCGCG ATCAACTGCT CGCCGAGATT
TTGCAAGAGG GTGGCCTGCT GATCGATGCT CAGGGCGGCG AAATTATTTT GCTTGAGCAA
GGCCAAGCGC GAATTGCCGC TAGCTGGGGA ATTGAAGCCC AAGCTACTGC CGCCTACCAA
ACCAGCCTCG CCGCCTATAT TTTAAAACGC AATGAGCCAT GGTCAGCCAC CAGCCTCGAA
AATAACAGCG ATTTTCAGCG TGTCGGTTTT GGTCAACGGA TTGACGCTGC CCTAGCTGTG
CCATTACGCT TAGAAGGTAA GCCGATTGGC CTGTTGGCGT TTTATAATCG GCCTGGCGGG
TTTAGCAAAC AAGAGCAAGC CACAGTCGAG GCTATGGGCA GCAAAGTCGA AGTTGTGCTA
GAGAATTTTC GCCAAGTCGA GGAGCGCTCT GAACGCGCCC GCTTGCAGCG CGAGTTTGAT
TTGGCGGCTG AGGTGCAGCA ACGCTTTTTG CCCCAGCAAT TACCAGCGAT CAGCGGCTAT
GAAATTGCTG GTTTTACTCA GCCCGCCCGT GAGGTTGGCG GCGATTTTTT CAATGTGCTG
AGTTTGCCCG ATGGCCGTTG GTATATCGCG GTTGGCGATG TGTCGGGCAA GGGCGTGGTT
GGCGCATTTT TCATGGCCAT CGCCATGAGC GTCATCGATT TACATTTGCA AGAGGGCCAA
TCAACCACCC AACTAAGCCT GGCCAATCGG CTTAATCCGT TGTTTTATCG GCGGATGGCC
CAGCAAAAAA TCAATACAGG TTTGGCCTAT GCCTTGCTTG ATGCTAAAAC TGGCCATATG
CAGTTAGGCA ACGCTGGCTT GATTGCGCCG TTGCATGTGC GCAAAAATGG CGAATGCGAT
TATCTCGATT TGACCGGCTT TCCCTTAGGT GCGGTTGCTC AGGCTGAATA TAGCGAATCG
GTGCTAGAGC TTCAGGCTGG TGAAAGTTTG ATTTTTATCA GCGATGGTGT GGTTGAAGCC
GCCAACCATG ATCGCGAATT ATTTGGCTTG AATCGACTGC GCAACTTGAT TAGTATGCTG
AGCCAGCGTC CGGCCTCGCA GTTGGTGAGC GAAATTATGA ACGCCGCCAA TCGCTGGAGC
GACGGCCAAT ACCAAGATGA TATGACCGTC GTGGTACTCC GTCGGATCTA A
 
Protein sequence
MNRARRHRDV ISISLLLSVI TIVMYLGEGR WAAAPALRYL YLIPIAQAAM GFGLMGSMAV 
AILADLLFAP LVATALAKYG MFGAPTVEII VTLVLMPVLA YFAGSGWGRL SRQRELYQFL
SRMGDLFGRS LPRDQLLAEI LQEGGLLIDA QGGEIILLEQ GQARIAASWG IEAQATAAYQ
TSLAAYILKR NEPWSATSLE NNSDFQRVGF GQRIDAALAV PLRLEGKPIG LLAFYNRPGG
FSKQEQATVE AMGSKVEVVL ENFRQVEERS ERARLQREFD LAAEVQQRFL PQQLPAISGY
EIAGFTQPAR EVGGDFFNVL SLPDGRWYIA VGDVSGKGVV GAFFMAIAMS VIDLHLQEGQ
STTQLSLANR LNPLFYRRMA QQKINTGLAY ALLDAKTGHM QLGNAGLIAP LHVRKNGECD
YLDLTGFPLG AVAQAEYSES VLELQAGESL IFISDGVVEA ANHDRELFGL NRLRNLISML
SQRPASQLVS EIMNAANRWS DGQYQDDMTV VVLRRI