Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4702 |
Symbol | |
ID | 5736549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6006995 |
End bp | 6009832 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281866 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001547461 |
Protein GI | 159901214 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCAG TTACCCTCGA ATCTTTCAGC ACGTTTGGAG CGCTGCTGCA TTTTCTGCGA CGCAGAGCAC GCCTAACCCA GCGCGATTTA GCGATTGCCA CGGGCTATAG CGAGGCCCAT ATCTCGCGCT TGGAAAACGA CCAGCGTTTG CCCGACCTCA CAACCTTAGT AGCATTAATG GTTCCAGCGC TCGATCTCAG TGATGATCCC GCAAGCGTCG CGCGTTTGTT GGAGCTAGCA GCGGCGGCTC GTGGCGAGTC ATTGGTTGGC ACGAGCGTCA CGGTCACCAA AAAAATCGAG CAGCAACAAC ACAGCGAACT TGGTTTGCTG GATTTGCCTC CACCCTTGCC CCCATTTTTA ATCGAACGCC AGGCCACCAA CCACGTGCAA CAACGCTTAG CCAACGAACG ATGTTTGTGC ATCAATGGGC TAGCTGGAGT TGGCAAAACC GTGATCGCCA GCCAAATCGC CCAAAGTTGG GGCGAGCGCT GCTTTTGGCT GAGTTTTACG CCCACGCTGA GCCTTTCGAG CGAAATTTTG ATTCGCCAGT TGGCGCTATT TTTGTTGAGC CACGGCGACG ATCAGGTTGA GCCGTTATTG CACTTGCCGC GTGATGGCGA AGCTGGTTTG AGTTTTGAAC GTCAGCTAGG CTTGTTGATC AACGGTTTGC AGCACATTCC GGCTTTGCTC TGTTTTGATA ATGCCCAATT GCTGATTGAT CAACCGCAAC TGCGTTTGAT GCTTGAACAG CTAGCCCAAA AAACCACCAG CCAAATCTTG CTACTGAGCC GCGAACAATT CAATTTACAA GGCTTTAGTT ATTGGTCGTT GCATGGATTA GAATTAATTG AAGCTCAACG CTTGCTCAAG CATTATGGAA CGCAACTTAG CCCAACCCAC AGCCAACAAT TAATCGAACG CACTCAAGCG AATCCGGCAT TATTGCGCTT AACTATTGGT TTATTGGGCG ATCGCGGAGC CGATGAAGCT TTGATGCAGC ATTTGATCGA CGAGCCACAT ATTGCCAGTT TTGTGCTCGA CCAGATGCTT GGTCAATTGC CGAGCAGCAG CGAACGCTTG CTAGCCTTGT TGGCGGTTGC TGCTCAGCCA CTAAACTTGC ATGCTGAATG GCCGATGGAA TGTAGTTTTG CCGTCGATGG GCCATATCAA TGGCAACAGG CCATGAGCGA ACTGACCCGC CGCCAGTTGA TTGATGCCCC CAGCCATGCC TATGTGCTGC CGCTGGTACA AGAATATATT TATGCCCAAT TGCGCAGTCA GCCCAGTCGC CGCAAAGCAC TGCACCAGCA ACTTGCGAGA GTTTTCGAAG ATCAACGAAT TGACCCAATT CGGGCCGCCC AGCATTATCT TGCGGCGGGC GATGTGCCTG CGGCGCTCAA CAGCCTGCAA CAACAACTTG ATCCCTTGAT GAATCAGGGT TTATCGAGCG CGGCGGCGGC AATTTTGCAA AGCATTCGCC CAACGATTGA GCAACAGCAT CCTGAATTGT TGTTTGCGTG GTTGCAAAGT TATGGTGAAT TTTTGATGGC TACCAGCCAA GCTAGCGAAG CCGAAGCCAT GTATCGCGAA GCGTTGGCTT TGGCCTGGCA ACCAAATCAA CGGGCACATT TGGTCTGGCG CTTAACTGGG GCCATGTTGC ATCGTAATCA AGCTACTGCC GCCCAAAGCT TGCTCGAACA AACCATTGCC ACGCTTGATT CGCGTGAGGT GCGCTTGCAT GGCTTGTTAC AAATGGCACT GAGCAAAGCG ATTTTGATGC AATCGCAATT TGCGCTTGCT CGTCAAGCCG CCGAACAAGC GATTGCCCTG ACCAGCCAAC TTGATCCTGA GGCAATTATG ACGATCGCTG AAATTCGGGC GCGGGCTGGC GGAACCTTGG CAATTGTGCA GCAATATACT GGCGAAGTCG ATGCCAGCAT TCAAACATGG CTCGATGTGA TTGCCCAAAC CCGCATTGCC CGCTTGGAGC GGGTGCGACC ACGGGCCTTT GTAAATTTGG CTAATCTCTA TTACACCAAA GGCGATTTAA ACCGCGCCGA AGCCACGATT AACGATGCGG TTACAGGCTT GCGTCGAATT GGTGATGTCT ATGCTCAAGC CCGCATGCAA CACACCCAAG CGATTATTCA AATGATGCGC GGCCAGCCCC AAGCGGCGTT ACAAACCCTC GAACAAGCCT GCGCGATCAA ACAACAGATC GACGATCAGC AAGGTTGGTA CAATTCGCGC AACCAAATTG CCATGACCTT GTTGGCCTTG GGTCAAACCG AGCAAGCTGA AGCCATCGCC CAAAATTTAT TGCATATGCT TGGCGAGGGT GGCGAGCCAT TTTTCCGAGG CGTAATTTTG GATACTTTGG CGATTGGGCA GTTGTTGCGT GGAGATCTAC GGGCAGCGCG GCAAAGCCTT GATCGGATTG CAGCCATGCC CATTGCCCAA AGCAACAATT TATTGCGCAT GGCTTGGGAG CGGCGTTCAG TCTTGTTGAT GTTGCTCACT GAAGGGGCTG AGCATGCTCG CACGGTGTTT AGCCATAGTT TGCCGCTCGC CGGCAACGGC GAAATTGCCC TCGATCATGC ATGCCTTGAT GCCTTGATTA CGCAAGCTGC TGGTGATGAG CTAGCTGCCC AACGCCAATG GCAACAACTG GCCCAACGGG CAGCCAGCAA TGGCTACGAA TATTATCGAA TTGTGGCCGA AGCACAGTTA CAAGCACCAA GCCATGTTAG CTTGCAACAA CGCGTGCTCC AAATGCACCA ACCATTCGCT CCCAATTGTT GGTATCAACC AGCCTTGGCA CAGGCAGTCA ATGATTAA
|
Protein sequence | MPAVTLESFS TFGALLHFLR RRARLTQRDL AIATGYSEAH ISRLENDQRL PDLTTLVALM VPALDLSDDP ASVARLLELA AAARGESLVG TSVTVTKKIE QQQHSELGLL DLPPPLPPFL IERQATNHVQ QRLANERCLC INGLAGVGKT VIASQIAQSW GERCFWLSFT PTLSLSSEIL IRQLALFLLS HGDDQVEPLL HLPRDGEAGL SFERQLGLLI NGLQHIPALL CFDNAQLLID QPQLRLMLEQ LAQKTTSQIL LLSREQFNLQ GFSYWSLHGL ELIEAQRLLK HYGTQLSPTH SQQLIERTQA NPALLRLTIG LLGDRGADEA LMQHLIDEPH IASFVLDQML GQLPSSSERL LALLAVAAQP LNLHAEWPME CSFAVDGPYQ WQQAMSELTR RQLIDAPSHA YVLPLVQEYI YAQLRSQPSR RKALHQQLAR VFEDQRIDPI RAAQHYLAAG DVPAALNSLQ QQLDPLMNQG LSSAAAAILQ SIRPTIEQQH PELLFAWLQS YGEFLMATSQ ASEAEAMYRE ALALAWQPNQ RAHLVWRLTG AMLHRNQATA AQSLLEQTIA TLDSREVRLH GLLQMALSKA ILMQSQFALA RQAAEQAIAL TSQLDPEAIM TIAEIRARAG GTLAIVQQYT GEVDASIQTW LDVIAQTRIA RLERVRPRAF VNLANLYYTK GDLNRAEATI NDAVTGLRRI GDVYAQARMQ HTQAIIQMMR GQPQAALQTL EQACAIKQQI DDQQGWYNSR NQIAMTLLAL GQTEQAEAIA QNLLHMLGEG GEPFFRGVIL DTLAIGQLLR GDLRAARQSL DRIAAMPIAQ SNNLLRMAWE RRSVLLMLLT EGAEHARTVF SHSLPLAGNG EIALDHACLD ALITQAAGDE LAAQRQWQQL AQRAASNGYE YYRIVAEAQL QAPSHVSLQQ RVLQMHQPFA PNCWYQPALA QAVND
|
| |