Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4375 |
Symbol | |
ID | 5736932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5589214 |
End bp | 5590764 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281537 |
Product | protein serine phosphatase with GAF(s) sensor(s) |
Protein accession | YP_001547135 |
Protein GI | 159900888 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTG CTCGCCGCCA TCGCGATGTG ATTAGCATCA GCCTGTTGCT CTCGGTCATT ACGATTGTGA TGTACCTTGG TGAGGGGCGT TGGGCGGCTG CGCCTGCGTT ACGCTACCTC TATTTGATTC CAATCGCCCA AGCTGCGATG GGCTTTGGCT TGATGGGCAG CATGGCCGTG GCGATTTTGG CTGATCTGCT GTTTGCGCCC TTGGTTGCCA CGGCCTTGGC CAAATATGGG ATGTTTGGTG CTCCTACAGT TGAAATTATT GTGACCCTCG TTTTGATGCC GGTCTTGGCC TATTTTGCGG GCAGCGGTTG GGGTCGGCTT AGTCGCCAGC GCGAGCTTTA TCAATTTTTG AGCCGCATGG GCGATTTATT TGGCCGTTCG CTACCCCGCG ATCAACTGCT CGCCGAGATT TTGCAAGAGG GTGGCCTGCT GATCGATGCT CAGGGCGGCG AAATTATTTT GCTTGAGCAA GGCCAAGCGC GAATTGCCGC TAGCTGGGGA ATTGAAGCCC AAGCTACTGC CGCCTACCAA ACCAGCCTCG CCGCCTATAT TTTAAAACGC AATGAGCCAT GGTCAGCCAC CAGCCTCGAA AATAACAGCG ATTTTCAGCG TGTCGGTTTT GGTCAACGGA TTGACGCTGC CCTAGCTGTG CCATTACGCT TAGAAGGTAA GCCGATTGGC CTGTTGGCGT TTTATAATCG GCCTGGCGGG TTTAGCAAAC AAGAGCAAGC CACAGTCGAG GCTATGGGCA GCAAAGTCGA AGTTGTGCTA GAGAATTTTC GCCAAGTCGA GGAGCGCTCT GAACGCGCCC GCTTGCAGCG CGAGTTTGAT TTGGCGGCTG AGGTGCAGCA ACGCTTTTTG CCCCAGCAAT TACCAGCGAT CAGCGGCTAT GAAATTGCTG GTTTTACTCA GCCCGCCCGT GAGGTTGGCG GCGATTTTTT CAATGTGCTG AGTTTGCCCG ATGGCCGTTG GTATATCGCG GTTGGCGATG TGTCGGGCAA GGGCGTGGTT GGCGCATTTT TCATGGCCAT CGCCATGAGC GTCATCGATT TACATTTGCA AGAGGGCCAA TCAACCACCC AACTAAGCCT GGCCAATCGG CTTAATCCGT TGTTTTATCG GCGGATGGCC CAGCAAAAAA TCAATACAGG TTTGGCCTAT GCCTTGCTTG ATGCTAAAAC TGGCCATATG CAGTTAGGCA ACGCTGGCTT GATTGCGCCG TTGCATGTGC GCAAAAATGG CGAATGCGAT TATCTCGATT TGACCGGCTT TCCCTTAGGT GCGGTTGCTC AGGCTGAATA TAGCGAATCG GTGCTAGAGC TTCAGGCTGG TGAAAGTTTG ATTTTTATCA GCGATGGTGT GGTTGAAGCC GCCAACCATG ATCGCGAATT ATTTGGCTTG AATCGACTGC GCAACTTGAT TAGTATGCTG AGCCAGCGTC CGGCCTCGCA GTTGGTGAGC GAAATTATGA ACGCCGCCAA TCGCTGGAGC GACGGCCAAT ACCAAGATGA TATGACCGTC GTGGTACTCC GTCGGATCTA A
|
Protein sequence | MNRARRHRDV ISISLLLSVI TIVMYLGEGR WAAAPALRYL YLIPIAQAAM GFGLMGSMAV AILADLLFAP LVATALAKYG MFGAPTVEII VTLVLMPVLA YFAGSGWGRL SRQRELYQFL SRMGDLFGRS LPRDQLLAEI LQEGGLLIDA QGGEIILLEQ GQARIAASWG IEAQATAAYQ TSLAAYILKR NEPWSATSLE NNSDFQRVGF GQRIDAALAV PLRLEGKPIG LLAFYNRPGG FSKQEQATVE AMGSKVEVVL ENFRQVEERS ERARLQREFD LAAEVQQRFL PQQLPAISGY EIAGFTQPAR EVGGDFFNVL SLPDGRWYIA VGDVSGKGVV GAFFMAIAMS VIDLHLQEGQ STTQLSLANR LNPLFYRRMA QQKINTGLAY ALLDAKTGHM QLGNAGLIAP LHVRKNGECD YLDLTGFPLG AVAQAEYSES VLELQAGESL IFISDGVVEA ANHDRELFGL NRLRNLISML SQRPASQLVS EIMNAANRWS DGQYQDDMTV VVLRRI
|
| |