Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1866 |
Symbol | |
ID | 5733755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2201175 |
End bp | 2203613 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279010 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001544637 |
Protein GI | 159898390 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA GCTTCGGTTA CTGGCTTAAA CAGCGGCGTA AAGAGCTTAA TTTTACCCAA GAATATTTAG CCGAGTTGGT AAGCTGTTCA ACCATTACTA TTCGCAAAAT CGAGTCGAAT GAACGGCGAC CTTCGCGCCA GATTGCGGCT CGGATCGCTA AATTTTGTCA AGTAGAAGCC AATCGGGCCT TTGTTGATGC AGCATGGGCT GGGCAATCGC CTAGCCCATC TGATGGTGGC TCGCCACCTG AGCCAGCTCC TTCGAACTTA CTGCCCCCAT TTAGTTCAAT CATTGGCCGC GATTCGGCGA TTGAATCAAT TTGTGTCCAA TTTCAAGCCC AAAAAGCCCG TTTAGTTACG ATTGTCGGCT CACCTGGCGT TGGCAAAACC CGCCTGGCCC AAGCCATTGG CCAACAGTTA CTCACACATT TTAGCGATGG CGTTTTTTGG ATTAGCCTCG ATCCAATCGT CAATGCTAGC CTTGTCCCAT CGTTAATTAC GCGGGTACTC GGCATTCACG AAAACCCCAA TCAATCGATC GAAGAAACAA TTTTCAACTG GCTCAAAAAT CGCCATTTGC TCCTCATTCT CGATAATTGT GAGCATATTA TTGAGTTGCG CCAGTTTGTA AACCAACTGT TAAGTTATTG TCCAACCCTC TCGATTCTTG CGACCAGCCG CGAAGTGTTG CATTTGCGCT GGGAACAGCG CTTTCCATTG CGCCCGCTGA CGGTTCCAGT ACGCGGTATG CAGCTTGATC TCGCGCAACT GGCCCAAATT CCAGCGATTG CGCTATTTTT AGAGCGCAGT CGGGCGATCA ATCCTCAGGC CGAGTTGAAT GCATCGAATG CCCGAGCAAT TAGCACGATT TGTATGCAGC TTGAAGGCTT GCCGCTAAGT ATTGAGTTAA TTGCTGCGCG TAGCGCCATG CTCAGCCCTC AAATGCTGGT GCATCGGCTG AATAATCAAT TGAATGTACT GACCCAAGGC TCGCGCGATT TGCCTCATCG CCAACAAACC TTGCGCAATG CCATCCAATG GAGTATCGAT TTGCTCGATA GTGCCGAGCA ATTTCTGCTG GTAGCGCTGG CATTAGCTCC CGAAAGTTGT ACCCTCCTGA GCCTAGAAGC GCTTGCTGAT TGCTATAGCC CGTGGCCGTG GTCGATTTTC GATGGCCTAA CCAACTTGTT CGATAAAAGC CTAATTTGGA TTCAGCAGCA GCAAACTGAT GAGCCACGCT TTGGGATGTT GCGGGTTTTG CGTGAATATG TGCTTGAGCA GCTTGCTGAG CCAACAACCA TTCAGCAATT ACGCCAAAGT TTTGCCAGCT ACTACCTGAA TATTGCCGAA ACCATTTATC AAAAGATGCT CAACTCGCGT ACCAATAGCC TTTTTCAAGA GATTGCCGCT GAGTATTATA ATTTTCACAC CGTTATCACA TGGTGCCTTG AACCACCATA TGATCTGGAA AATGCGATTA AGCTAATTGC GACGTTAATC GATTTTCTAC ATCTCTATGG CTATCAACGC GAGGGGATTA GTTGGTTACA ACACATTTTG GGCCTGATTG AACAACAAAC AGTCACGCTT AGCCCAGCGA TTCTGGCCGA TGCCTATAAC GCCTTAGGCT TTTTATACTA CCATCAGGGC AATATTAACC AAGCCCAACA CTTTTTTGAG CGCGTATTGG AGCTTATTGG CGGCCACACA TCGTTTAAAC ATGCACGAAT TTTGTATAAT TTAGGTTTAG TTAAAAAGAA CAAAGGCGAA TTTCTTCAGG CCGAGGCCGA TTTACAAGCC AGTTTAGCAA GTTGGCGCAC CCTTGGTTTA CAGCCAGGCG AAGCCTATTC GCTCTGGGGG TTGGGCAGTT TAGCCCTCGA CCAAGGCCTC TATACTCATG GGTTAACCTA CCTGCAACAA AGCTTGGCAA TTTGGCAAAC GCTTGAATCA ACTCATGGAC AAGTGATGGT GTTAAGTGAT TTGGCCGAGT TAGCCTTACT ACAAGCCAAT CCGCATGAGG CTGAGCAAAT ATTAGCCCAG ATTAAAACGA TTGTTGAGGC CAGCAATTAT ACAATCACAA GTTCACGTAT AGCCTTGCTC GAAGGTAAAT GTGCGATGCA ACGCCACGAT TTTAGCCATG CCCAAACCTG CTTCGAAGAA GCCGAGGAGA TCGCTGAAGA ACAGCAATCA ACCGCCTATT TAGCCAAAAT CCACCTCGAA CAGGCTAAAC TGGCTTTGGT GCAGGCACAC TATCATCAGG CCAGTTATCA TGGCTATGAA GGGTTGCGCC TAGCGACCAT GCTTGAACAT CAGACTGGGA TTGCCAAGGC CCACCACGTG CTGGCCCAAG TCTATCAGCA GTTGGCCAAT CCGAGTCAAG CCGAGCAACA TTGGCAAGCT TATGCAGCAA TTTATCAACA CGTTGGTTTA GTGCCATAA
|
Protein sequence | MSESFGYWLK QRRKELNFTQ EYLAELVSCS TITIRKIESN ERRPSRQIAA RIAKFCQVEA NRAFVDAAWA GQSPSPSDGG SPPEPAPSNL LPPFSSIIGR DSAIESICVQ FQAQKARLVT IVGSPGVGKT RLAQAIGQQL LTHFSDGVFW ISLDPIVNAS LVPSLITRVL GIHENPNQSI EETIFNWLKN RHLLLILDNC EHIIELRQFV NQLLSYCPTL SILATSREVL HLRWEQRFPL RPLTVPVRGM QLDLAQLAQI PAIALFLERS RAINPQAELN ASNARAISTI CMQLEGLPLS IELIAARSAM LSPQMLVHRL NNQLNVLTQG SRDLPHRQQT LRNAIQWSID LLDSAEQFLL VALALAPESC TLLSLEALAD CYSPWPWSIF DGLTNLFDKS LIWIQQQQTD EPRFGMLRVL REYVLEQLAE PTTIQQLRQS FASYYLNIAE TIYQKMLNSR TNSLFQEIAA EYYNFHTVIT WCLEPPYDLE NAIKLIATLI DFLHLYGYQR EGISWLQHIL GLIEQQTVTL SPAILADAYN ALGFLYYHQG NINQAQHFFE RVLELIGGHT SFKHARILYN LGLVKKNKGE FLQAEADLQA SLASWRTLGL QPGEAYSLWG LGSLALDQGL YTHGLTYLQQ SLAIWQTLES THGQVMVLSD LAELALLQAN PHEAEQILAQ IKTIVEASNY TITSSRIALL EGKCAMQRHD FSHAQTCFEE AEEIAEEQQS TAYLAKIHLE QAKLALVQAH YHQASYHGYE GLRLATMLEH QTGIAKAHHV LAQVYQQLAN PSQAEQHWQA YAAIYQHVGL VP
|
| |