Gene Haur_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3681 
Symbol 
ID5735560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4629680 
End bp4633333 
Gene Length3654 bp 
Protein Length1217 aa 
Translation table11 
GC content52% 
IMG OID641280833 
ProductSARP family transcriptional regulator 
Protein accessionYP_001546445 
Protein GI159900198 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATT GCTCGCTCTA CCTATTTGGC TCGCCACGCC TCCAATATGA AACCCAACAG 
ATTGCGCTTG GCAATGGCCC GCTAGCCGCA TTATTGGTAT TTTTAGCCTT AAATCGCCAG
CAGCCCATTA GTCGTTTGCG CTTGGCGGCG ACAATCTGGC CCGATTTGCC CGAACCTCAG
GCGCGACGAG CACTTTCGGC GACACTCTAT CGGCTACGCC AAAATAGCCC AAGTTGTGAT
GAATGGCTCT TGGCGAACGC CCAAACCTTG CAATTAGGCT CGATTTGGTG CGATCTCCAA
GCGTTTCAGC AAGGTGCGCA AAGCAATCAA AGCATCGATT GGCAATTGGC ATGTCAATTA
TATCGTGATG AATTATTGCC TGATGTTGAT GCCGAATGGC TCGATGGCCC CCGCGCTGAT
TTGCAAGCAC TGTTTGCCGC CACAATTGCC AAATTGGCGC ACCATGCCTT TGCCCAAGGC
GATTATAGTG TGGCGCTAAC AAGTGCTGAG CGTTGGCAAG CGTTCGATCC CTTCGATGAA
CAAGCCTGTA TGCTCCTGAT GCGTTGCCAT GTAAGCCTTG GCCGTCCGCA CTTGGCATTA
ACCGCCTATC GTCATTTGTG CGAACGTTTG GCTGCTGATT TGGGCGTTGA GCCAGTTGCC
GAAACCAGTG CCCTCGCCGA GCAAATTCGG GCTGAACAAA CCTTGCGTCA AACCACGATT
GGCGGTAGTT GGCAGAAAAT ACCATTTGTG GGTCGCACGC ATGAACGTTC GTTGTTGCTT
GATGGCTTGG ATGCTGCCTT GAGTCATAGT GGTGGCTTGG TGCTGCTCGA TGGCGCTGCT
GGAATTGGCA AAACCCGTTT ATTGCACGAA TTGAGCCAAG CAGCCCAAGC GCGTGGTTTT
CGGGTGGCTT GGGGTGCGGG TCAAATTAAT GGTGGCAATT CGCCCTACGC TCCACTTGAT
CAAGCTCTGA CCCAAATCCT TGATCAAAGC CTGCTTGACC AATTTGATCC GATTACACGG
CTTGGCTTGT CGGCGCTTGT GCCAAATTTA CCCCAATTGA GTCAGCACGA TCACGCGAAT
CGTCCGAGTT TGCCTGCGGC GTTGTTGAAT GCCTTACTCG CAGCGACCCA GCAAGCACCA
TTATTATTAG TACTAGATGA TATGCATTGG GCGCAACGGA TTGTATTTAA TGTGCTGCAA
TGGTCGCCGA GCCAAATTCA GGCACTCTTA CGCTCCCGTT TATTGCTGAT CTTGGCCTAT
CGCGGTAGCG AATCGAGCCA TATGTTGCAA GCCTTGCGCC GCATGCGTCA AGAATTGCCA
GTTCAATCAG TTAAATTAAA TGGCTTGGCG CTGCACGATT TTCAACATCT GGTGGGGCGC
TTGTGGCCGA ATCATGTGCC GATTCCGAGC ATGGCCGAAA TTGCCGCTTT ACATGCCCTG
ACTGCGGGCA ATCCACTCTT TTTGCAAGAG CAATTATTAC ATCGTGGCGA TGCTCATGAG
CACTCATTTC AAGCATTGGT GGCGCAGCGA GTGCAGGCCT TGCCAATGTT GGCGCATCGG
GCTTTGGCAG CGGCCAATGC CTTGGGTCGT AGCTGGACAT TAGCGGCTTG GCGGTTTGTC
GCAGGCACTG AGGTCGATCA AGCCATCCCT GATCTTTTAG ATGCGCGGCT GGTGGCGCAG
ACACATGCAG GTTTTCAGTT TTACCACGAT CTGATTGCCG TGGCGGTTGA GCAAAGTTTA
GCACTGGAAT TATGCCAAGC AGCGGTGCAA CAGGCCGCCA ATTATTTTGA GCAACAGCCA
CTTTGTCGGC CCGAAACAAT TGCTTGGGCC TATGAGCGAG CACAACGCTG GGCGGCGGCG
ATTGGAGCCT ATCAACAAGC TGGCGAACAA GCCTTGCAAG CCTATGCCTA CACGACAGCC
CTCGATTATG CCAATCGGGC TTTGGAGTTA TATCAGCAAA TACCAGTTGA TGCCCGTTTG
GAATTGACCT TATTGCGCTT GCGTCAACGG GTTTTGGTGT TTTTGGGCCA GCTAGAAGTT
TGGCGGGCCG ACGTTGAGCG TCTAGAAACT TTGGCTTTGA GTTTGGGCGA TCAAGCGGCA
TTGCTCGAAG TGTATGAAAG CCGGATTGTG CTCTCTTCAG TTGATTCTAA CCCAACTGAA
ATGGCGAGTA TTGCCGAGCG AGCTTTAGAT TTGGCTCAAG CTCAGCAATT GCCAGCGGTA
GAAGCCCGCA TTTTAAATAC CTATGGCTTT CATTTAATTA GTAGCGCCGC AGTCCAGCCA
CGCAATAGTT TGCCATTGCT TGAACGAGCA GTTGCCCTAG CCCGTAGCAG CCACGATGAT
ACGGTCTTGG TGGCCGCGCT CTGCTCATTA GCCTTTGCCT ACCGGATGCT GGGCGATACC
AGCATAGCTC AGCGGATTGC CGCCGAAGCC TTGACCTTGA CCGAATTACA CCCCTATTTG
TATCCAGCGC GGGGCAATGT GTTGCGGGTA TTGAGCGAAG TTGGCATTAG CTATGCCGAT
TGGGAAACCG CGCTTAGCAC GATGAACAAA TCAATTGAAT TGCTCGAAGC GCTTGATGAT
ATTTGGCTTT TGGGTGTGGG TTTGTTCATG TCAACCTTCA TCACGACTGC GCTTGGCTTG
ACCGAAATGG CCCAAGCGAC GACTCAGCGT ATTCGCCAGA TGATTCGTGA TTCCAAAATG
CCGCCCAACT CCAATTGGTC GTTTTTTGCC CATAGCGTCA CAATTTTGGT GGCCTTAGAA
GCTGGCGATT TTGAGCAAGC CGAAACGGTG GTGCAAGAAG TGCAGCCTTG GCTTGATCAG
GCGCATCAAC AAGGCGCAGG GTTATATCTA CTCTCGGCAA TTGGCGCAAT GGAGATTTTT
CGCAATCGGC CCGACCAAGC CTTGCCTTTG TTGCGGCGGG CTACAGCCAT GTGGCAACAA
GCCCGATCAG CCTTTTTGCA ACCAATTTTG ATGCATGCCT TAGCTGCTCA ATTATGTGGT
TTTGCTGCCG AAGCCCAAGC GATGTTGGCC GAAGCGGAAG CTATGTACGA CCCCAAAGAG
CTGTTTTATG CCGATGTATT GCTGCATTTC ACGCGGTTTT GGGTGTATGG TGATCGGCTA
CATTTGCAAC ATGCCTACAG CTCAATTCAC AATCAGGCCA ATCGTTTCCG TGACCCAACC
CTGCGTGATT CATTTATAAA CAATGTTAAG TTGCATGGCA TGGTGCTGCA ATTGCAACGA
GTTGCGCCGC TGGCTGGGGC GATTCGCTCG ATGGCTGGCC TGTGGATGCG CCTGACGCGG
GTGTATGGCC GTACTCAAAT GCTTTCCGAG GGCGGCTATA TTCAACGCAA AGTGTTGTTA
GTTCGCGCTG ATGTGCCCTT GGGCAAATCA CTGAGCCACA CCGATCGGGT TGAGGTAATT
TGGACTTTGC ATGCGCCTGA GGATAATCGC TTTAACGATC GCAGCGAATT GCGCATCCAT
CGTTTGCAGC GCCTACTTGA TGAAGCCGAC GATGCTGGAG CCGCCCCAAC CGACGACGAT
TTAGCTGATG CTTTGGCCGT TAGTCGGCGC ACAATCATTC GTGATATGGC CTTGTTGCAA
TCCCAAGGCT CGGCTGTGAG CACCCGCCGC CGTCGCGCAG TGGGCGAGGA GTGA
 
Protein sequence
MQDCSLYLFG SPRLQYETQQ IALGNGPLAA LLVFLALNRQ QPISRLRLAA TIWPDLPEPQ 
ARRALSATLY RLRQNSPSCD EWLLANAQTL QLGSIWCDLQ AFQQGAQSNQ SIDWQLACQL
YRDELLPDVD AEWLDGPRAD LQALFAATIA KLAHHAFAQG DYSVALTSAE RWQAFDPFDE
QACMLLMRCH VSLGRPHLAL TAYRHLCERL AADLGVEPVA ETSALAEQIR AEQTLRQTTI
GGSWQKIPFV GRTHERSLLL DGLDAALSHS GGLVLLDGAA GIGKTRLLHE LSQAAQARGF
RVAWGAGQIN GGNSPYAPLD QALTQILDQS LLDQFDPITR LGLSALVPNL PQLSQHDHAN
RPSLPAALLN ALLAATQQAP LLLVLDDMHW AQRIVFNVLQ WSPSQIQALL RSRLLLILAY
RGSESSHMLQ ALRRMRQELP VQSVKLNGLA LHDFQHLVGR LWPNHVPIPS MAEIAALHAL
TAGNPLFLQE QLLHRGDAHE HSFQALVAQR VQALPMLAHR ALAAANALGR SWTLAAWRFV
AGTEVDQAIP DLLDARLVAQ THAGFQFYHD LIAVAVEQSL ALELCQAAVQ QAANYFEQQP
LCRPETIAWA YERAQRWAAA IGAYQQAGEQ ALQAYAYTTA LDYANRALEL YQQIPVDARL
ELTLLRLRQR VLVFLGQLEV WRADVERLET LALSLGDQAA LLEVYESRIV LSSVDSNPTE
MASIAERALD LAQAQQLPAV EARILNTYGF HLISSAAVQP RNSLPLLERA VALARSSHDD
TVLVAALCSL AFAYRMLGDT SIAQRIAAEA LTLTELHPYL YPARGNVLRV LSEVGISYAD
WETALSTMNK SIELLEALDD IWLLGVGLFM STFITTALGL TEMAQATTQR IRQMIRDSKM
PPNSNWSFFA HSVTILVALE AGDFEQAETV VQEVQPWLDQ AHQQGAGLYL LSAIGAMEIF
RNRPDQALPL LRRATAMWQQ ARSAFLQPIL MHALAAQLCG FAAEAQAMLA EAEAMYDPKE
LFYADVLLHF TRFWVYGDRL HLQHAYSSIH NQANRFRDPT LRDSFINNVK LHGMVLQLQR
VAPLAGAIRS MAGLWMRLTR VYGRTQMLSE GGYIQRKVLL VRADVPLGKS LSHTDRVEVI
WTLHAPEDNR FNDRSELRIH RLQRLLDEAD DAGAAPTDDD LADALAVSRR TIIRDMALLQ
SQGSAVSTRR RRAVGEE