Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3681 |
Symbol | |
ID | 5735560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4629680 |
End bp | 4633333 |
Gene Length | 3654 bp |
Protein Length | 1217 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280833 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001546445 |
Protein GI | 159900198 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.134161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATT GCTCGCTCTA CCTATTTGGC TCGCCACGCC TCCAATATGA AACCCAACAG ATTGCGCTTG GCAATGGCCC GCTAGCCGCA TTATTGGTAT TTTTAGCCTT AAATCGCCAG CAGCCCATTA GTCGTTTGCG CTTGGCGGCG ACAATCTGGC CCGATTTGCC CGAACCTCAG GCGCGACGAG CACTTTCGGC GACACTCTAT CGGCTACGCC AAAATAGCCC AAGTTGTGAT GAATGGCTCT TGGCGAACGC CCAAACCTTG CAATTAGGCT CGATTTGGTG CGATCTCCAA GCGTTTCAGC AAGGTGCGCA AAGCAATCAA AGCATCGATT GGCAATTGGC ATGTCAATTA TATCGTGATG AATTATTGCC TGATGTTGAT GCCGAATGGC TCGATGGCCC CCGCGCTGAT TTGCAAGCAC TGTTTGCCGC CACAATTGCC AAATTGGCGC ACCATGCCTT TGCCCAAGGC GATTATAGTG TGGCGCTAAC AAGTGCTGAG CGTTGGCAAG CGTTCGATCC CTTCGATGAA CAAGCCTGTA TGCTCCTGAT GCGTTGCCAT GTAAGCCTTG GCCGTCCGCA CTTGGCATTA ACCGCCTATC GTCATTTGTG CGAACGTTTG GCTGCTGATT TGGGCGTTGA GCCAGTTGCC GAAACCAGTG CCCTCGCCGA GCAAATTCGG GCTGAACAAA CCTTGCGTCA AACCACGATT GGCGGTAGTT GGCAGAAAAT ACCATTTGTG GGTCGCACGC ATGAACGTTC GTTGTTGCTT GATGGCTTGG ATGCTGCCTT GAGTCATAGT GGTGGCTTGG TGCTGCTCGA TGGCGCTGCT GGAATTGGCA AAACCCGTTT ATTGCACGAA TTGAGCCAAG CAGCCCAAGC GCGTGGTTTT CGGGTGGCTT GGGGTGCGGG TCAAATTAAT GGTGGCAATT CGCCCTACGC TCCACTTGAT CAAGCTCTGA CCCAAATCCT TGATCAAAGC CTGCTTGACC AATTTGATCC GATTACACGG CTTGGCTTGT CGGCGCTTGT GCCAAATTTA CCCCAATTGA GTCAGCACGA TCACGCGAAT CGTCCGAGTT TGCCTGCGGC GTTGTTGAAT GCCTTACTCG CAGCGACCCA GCAAGCACCA TTATTATTAG TACTAGATGA TATGCATTGG GCGCAACGGA TTGTATTTAA TGTGCTGCAA TGGTCGCCGA GCCAAATTCA GGCACTCTTA CGCTCCCGTT TATTGCTGAT CTTGGCCTAT CGCGGTAGCG AATCGAGCCA TATGTTGCAA GCCTTGCGCC GCATGCGTCA AGAATTGCCA GTTCAATCAG TTAAATTAAA TGGCTTGGCG CTGCACGATT TTCAACATCT GGTGGGGCGC TTGTGGCCGA ATCATGTGCC GATTCCGAGC ATGGCCGAAA TTGCCGCTTT ACATGCCCTG ACTGCGGGCA ATCCACTCTT TTTGCAAGAG CAATTATTAC ATCGTGGCGA TGCTCATGAG CACTCATTTC AAGCATTGGT GGCGCAGCGA GTGCAGGCCT TGCCAATGTT GGCGCATCGG GCTTTGGCAG CGGCCAATGC CTTGGGTCGT AGCTGGACAT TAGCGGCTTG GCGGTTTGTC GCAGGCACTG AGGTCGATCA AGCCATCCCT GATCTTTTAG ATGCGCGGCT GGTGGCGCAG ACACATGCAG GTTTTCAGTT TTACCACGAT CTGATTGCCG TGGCGGTTGA GCAAAGTTTA GCACTGGAAT TATGCCAAGC AGCGGTGCAA CAGGCCGCCA ATTATTTTGA GCAACAGCCA CTTTGTCGGC CCGAAACAAT TGCTTGGGCC TATGAGCGAG CACAACGCTG GGCGGCGGCG ATTGGAGCCT ATCAACAAGC TGGCGAACAA GCCTTGCAAG CCTATGCCTA CACGACAGCC CTCGATTATG CCAATCGGGC TTTGGAGTTA TATCAGCAAA TACCAGTTGA TGCCCGTTTG GAATTGACCT TATTGCGCTT GCGTCAACGG GTTTTGGTGT TTTTGGGCCA GCTAGAAGTT TGGCGGGCCG ACGTTGAGCG TCTAGAAACT TTGGCTTTGA GTTTGGGCGA TCAAGCGGCA TTGCTCGAAG TGTATGAAAG CCGGATTGTG CTCTCTTCAG TTGATTCTAA CCCAACTGAA ATGGCGAGTA TTGCCGAGCG AGCTTTAGAT TTGGCTCAAG CTCAGCAATT GCCAGCGGTA GAAGCCCGCA TTTTAAATAC CTATGGCTTT CATTTAATTA GTAGCGCCGC AGTCCAGCCA CGCAATAGTT TGCCATTGCT TGAACGAGCA GTTGCCCTAG CCCGTAGCAG CCACGATGAT ACGGTCTTGG TGGCCGCGCT CTGCTCATTA GCCTTTGCCT ACCGGATGCT GGGCGATACC AGCATAGCTC AGCGGATTGC CGCCGAAGCC TTGACCTTGA CCGAATTACA CCCCTATTTG TATCCAGCGC GGGGCAATGT GTTGCGGGTA TTGAGCGAAG TTGGCATTAG CTATGCCGAT TGGGAAACCG CGCTTAGCAC GATGAACAAA TCAATTGAAT TGCTCGAAGC GCTTGATGAT ATTTGGCTTT TGGGTGTGGG TTTGTTCATG TCAACCTTCA TCACGACTGC GCTTGGCTTG ACCGAAATGG CCCAAGCGAC GACTCAGCGT ATTCGCCAGA TGATTCGTGA TTCCAAAATG CCGCCCAACT CCAATTGGTC GTTTTTTGCC CATAGCGTCA CAATTTTGGT GGCCTTAGAA GCTGGCGATT TTGAGCAAGC CGAAACGGTG GTGCAAGAAG TGCAGCCTTG GCTTGATCAG GCGCATCAAC AAGGCGCAGG GTTATATCTA CTCTCGGCAA TTGGCGCAAT GGAGATTTTT CGCAATCGGC CCGACCAAGC CTTGCCTTTG TTGCGGCGGG CTACAGCCAT GTGGCAACAA GCCCGATCAG CCTTTTTGCA ACCAATTTTG ATGCATGCCT TAGCTGCTCA ATTATGTGGT TTTGCTGCCG AAGCCCAAGC GATGTTGGCC GAAGCGGAAG CTATGTACGA CCCCAAAGAG CTGTTTTATG CCGATGTATT GCTGCATTTC ACGCGGTTTT GGGTGTATGG TGATCGGCTA CATTTGCAAC ATGCCTACAG CTCAATTCAC AATCAGGCCA ATCGTTTCCG TGACCCAACC CTGCGTGATT CATTTATAAA CAATGTTAAG TTGCATGGCA TGGTGCTGCA ATTGCAACGA GTTGCGCCGC TGGCTGGGGC GATTCGCTCG ATGGCTGGCC TGTGGATGCG CCTGACGCGG GTGTATGGCC GTACTCAAAT GCTTTCCGAG GGCGGCTATA TTCAACGCAA AGTGTTGTTA GTTCGCGCTG ATGTGCCCTT GGGCAAATCA CTGAGCCACA CCGATCGGGT TGAGGTAATT TGGACTTTGC ATGCGCCTGA GGATAATCGC TTTAACGATC GCAGCGAATT GCGCATCCAT CGTTTGCAGC GCCTACTTGA TGAAGCCGAC GATGCTGGAG CCGCCCCAAC CGACGACGAT TTAGCTGATG CTTTGGCCGT TAGTCGGCGC ACAATCATTC GTGATATGGC CTTGTTGCAA TCCCAAGGCT CGGCTGTGAG CACCCGCCGC CGTCGCGCAG TGGGCGAGGA GTGA
|
Protein sequence | MQDCSLYLFG SPRLQYETQQ IALGNGPLAA LLVFLALNRQ QPISRLRLAA TIWPDLPEPQ ARRALSATLY RLRQNSPSCD EWLLANAQTL QLGSIWCDLQ AFQQGAQSNQ SIDWQLACQL YRDELLPDVD AEWLDGPRAD LQALFAATIA KLAHHAFAQG DYSVALTSAE RWQAFDPFDE QACMLLMRCH VSLGRPHLAL TAYRHLCERL AADLGVEPVA ETSALAEQIR AEQTLRQTTI GGSWQKIPFV GRTHERSLLL DGLDAALSHS GGLVLLDGAA GIGKTRLLHE LSQAAQARGF RVAWGAGQIN GGNSPYAPLD QALTQILDQS LLDQFDPITR LGLSALVPNL PQLSQHDHAN RPSLPAALLN ALLAATQQAP LLLVLDDMHW AQRIVFNVLQ WSPSQIQALL RSRLLLILAY RGSESSHMLQ ALRRMRQELP VQSVKLNGLA LHDFQHLVGR LWPNHVPIPS MAEIAALHAL TAGNPLFLQE QLLHRGDAHE HSFQALVAQR VQALPMLAHR ALAAANALGR SWTLAAWRFV AGTEVDQAIP DLLDARLVAQ THAGFQFYHD LIAVAVEQSL ALELCQAAVQ QAANYFEQQP LCRPETIAWA YERAQRWAAA IGAYQQAGEQ ALQAYAYTTA LDYANRALEL YQQIPVDARL ELTLLRLRQR VLVFLGQLEV WRADVERLET LALSLGDQAA LLEVYESRIV LSSVDSNPTE MASIAERALD LAQAQQLPAV EARILNTYGF HLISSAAVQP RNSLPLLERA VALARSSHDD TVLVAALCSL AFAYRMLGDT SIAQRIAAEA LTLTELHPYL YPARGNVLRV LSEVGISYAD WETALSTMNK SIELLEALDD IWLLGVGLFM STFITTALGL TEMAQATTQR IRQMIRDSKM PPNSNWSFFA HSVTILVALE AGDFEQAETV VQEVQPWLDQ AHQQGAGLYL LSAIGAMEIF RNRPDQALPL LRRATAMWQQ ARSAFLQPIL MHALAAQLCG FAAEAQAMLA EAEAMYDPKE LFYADVLLHF TRFWVYGDRL HLQHAYSSIH NQANRFRDPT LRDSFINNVK LHGMVLQLQR VAPLAGAIRS MAGLWMRLTR VYGRTQMLSE GGYIQRKVLL VRADVPLGKS LSHTDRVEVI WTLHAPEDNR FNDRSELRIH RLQRLLDEAD DAGAAPTDDD LADALAVSRR TIIRDMALLQ SQGSAVSTRR RRAVGEE
|
| |