Gene Haur_4702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4702 
Symbol 
ID5736549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6006995 
End bp6009832 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content52% 
IMG OID641281866 
ProductXRE family transcriptional regulator 
Protein accessionYP_001547461 
Protein GI159901214 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAG TTACCCTCGA ATCTTTCAGC ACGTTTGGAG CGCTGCTGCA TTTTCTGCGA 
CGCAGAGCAC GCCTAACCCA GCGCGATTTA GCGATTGCCA CGGGCTATAG CGAGGCCCAT
ATCTCGCGCT TGGAAAACGA CCAGCGTTTG CCCGACCTCA CAACCTTAGT AGCATTAATG
GTTCCAGCGC TCGATCTCAG TGATGATCCC GCAAGCGTCG CGCGTTTGTT GGAGCTAGCA
GCGGCGGCTC GTGGCGAGTC ATTGGTTGGC ACGAGCGTCA CGGTCACCAA AAAAATCGAG
CAGCAACAAC ACAGCGAACT TGGTTTGCTG GATTTGCCTC CACCCTTGCC CCCATTTTTA
ATCGAACGCC AGGCCACCAA CCACGTGCAA CAACGCTTAG CCAACGAACG ATGTTTGTGC
ATCAATGGGC TAGCTGGAGT TGGCAAAACC GTGATCGCCA GCCAAATCGC CCAAAGTTGG
GGCGAGCGCT GCTTTTGGCT GAGTTTTACG CCCACGCTGA GCCTTTCGAG CGAAATTTTG
ATTCGCCAGT TGGCGCTATT TTTGTTGAGC CACGGCGACG ATCAGGTTGA GCCGTTATTG
CACTTGCCGC GTGATGGCGA AGCTGGTTTG AGTTTTGAAC GTCAGCTAGG CTTGTTGATC
AACGGTTTGC AGCACATTCC GGCTTTGCTC TGTTTTGATA ATGCCCAATT GCTGATTGAT
CAACCGCAAC TGCGTTTGAT GCTTGAACAG CTAGCCCAAA AAACCACCAG CCAAATCTTG
CTACTGAGCC GCGAACAATT CAATTTACAA GGCTTTAGTT ATTGGTCGTT GCATGGATTA
GAATTAATTG AAGCTCAACG CTTGCTCAAG CATTATGGAA CGCAACTTAG CCCAACCCAC
AGCCAACAAT TAATCGAACG CACTCAAGCG AATCCGGCAT TATTGCGCTT AACTATTGGT
TTATTGGGCG ATCGCGGAGC CGATGAAGCT TTGATGCAGC ATTTGATCGA CGAGCCACAT
ATTGCCAGTT TTGTGCTCGA CCAGATGCTT GGTCAATTGC CGAGCAGCAG CGAACGCTTG
CTAGCCTTGT TGGCGGTTGC TGCTCAGCCA CTAAACTTGC ATGCTGAATG GCCGATGGAA
TGTAGTTTTG CCGTCGATGG GCCATATCAA TGGCAACAGG CCATGAGCGA ACTGACCCGC
CGCCAGTTGA TTGATGCCCC CAGCCATGCC TATGTGCTGC CGCTGGTACA AGAATATATT
TATGCCCAAT TGCGCAGTCA GCCCAGTCGC CGCAAAGCAC TGCACCAGCA ACTTGCGAGA
GTTTTCGAAG ATCAACGAAT TGACCCAATT CGGGCCGCCC AGCATTATCT TGCGGCGGGC
GATGTGCCTG CGGCGCTCAA CAGCCTGCAA CAACAACTTG ATCCCTTGAT GAATCAGGGT
TTATCGAGCG CGGCGGCGGC AATTTTGCAA AGCATTCGCC CAACGATTGA GCAACAGCAT
CCTGAATTGT TGTTTGCGTG GTTGCAAAGT TATGGTGAAT TTTTGATGGC TACCAGCCAA
GCTAGCGAAG CCGAAGCCAT GTATCGCGAA GCGTTGGCTT TGGCCTGGCA ACCAAATCAA
CGGGCACATT TGGTCTGGCG CTTAACTGGG GCCATGTTGC ATCGTAATCA AGCTACTGCC
GCCCAAAGCT TGCTCGAACA AACCATTGCC ACGCTTGATT CGCGTGAGGT GCGCTTGCAT
GGCTTGTTAC AAATGGCACT GAGCAAAGCG ATTTTGATGC AATCGCAATT TGCGCTTGCT
CGTCAAGCCG CCGAACAAGC GATTGCCCTG ACCAGCCAAC TTGATCCTGA GGCAATTATG
ACGATCGCTG AAATTCGGGC GCGGGCTGGC GGAACCTTGG CAATTGTGCA GCAATATACT
GGCGAAGTCG ATGCCAGCAT TCAAACATGG CTCGATGTGA TTGCCCAAAC CCGCATTGCC
CGCTTGGAGC GGGTGCGACC ACGGGCCTTT GTAAATTTGG CTAATCTCTA TTACACCAAA
GGCGATTTAA ACCGCGCCGA AGCCACGATT AACGATGCGG TTACAGGCTT GCGTCGAATT
GGTGATGTCT ATGCTCAAGC CCGCATGCAA CACACCCAAG CGATTATTCA AATGATGCGC
GGCCAGCCCC AAGCGGCGTT ACAAACCCTC GAACAAGCCT GCGCGATCAA ACAACAGATC
GACGATCAGC AAGGTTGGTA CAATTCGCGC AACCAAATTG CCATGACCTT GTTGGCCTTG
GGTCAAACCG AGCAAGCTGA AGCCATCGCC CAAAATTTAT TGCATATGCT TGGCGAGGGT
GGCGAGCCAT TTTTCCGAGG CGTAATTTTG GATACTTTGG CGATTGGGCA GTTGTTGCGT
GGAGATCTAC GGGCAGCGCG GCAAAGCCTT GATCGGATTG CAGCCATGCC CATTGCCCAA
AGCAACAATT TATTGCGCAT GGCTTGGGAG CGGCGTTCAG TCTTGTTGAT GTTGCTCACT
GAAGGGGCTG AGCATGCTCG CACGGTGTTT AGCCATAGTT TGCCGCTCGC CGGCAACGGC
GAAATTGCCC TCGATCATGC ATGCCTTGAT GCCTTGATTA CGCAAGCTGC TGGTGATGAG
CTAGCTGCCC AACGCCAATG GCAACAACTG GCCCAACGGG CAGCCAGCAA TGGCTACGAA
TATTATCGAA TTGTGGCCGA AGCACAGTTA CAAGCACCAA GCCATGTTAG CTTGCAACAA
CGCGTGCTCC AAATGCACCA ACCATTCGCT CCCAATTGTT GGTATCAACC AGCCTTGGCA
CAGGCAGTCA ATGATTAA
 
Protein sequence
MPAVTLESFS TFGALLHFLR RRARLTQRDL AIATGYSEAH ISRLENDQRL PDLTTLVALM 
VPALDLSDDP ASVARLLELA AAARGESLVG TSVTVTKKIE QQQHSELGLL DLPPPLPPFL
IERQATNHVQ QRLANERCLC INGLAGVGKT VIASQIAQSW GERCFWLSFT PTLSLSSEIL
IRQLALFLLS HGDDQVEPLL HLPRDGEAGL SFERQLGLLI NGLQHIPALL CFDNAQLLID
QPQLRLMLEQ LAQKTTSQIL LLSREQFNLQ GFSYWSLHGL ELIEAQRLLK HYGTQLSPTH
SQQLIERTQA NPALLRLTIG LLGDRGADEA LMQHLIDEPH IASFVLDQML GQLPSSSERL
LALLAVAAQP LNLHAEWPME CSFAVDGPYQ WQQAMSELTR RQLIDAPSHA YVLPLVQEYI
YAQLRSQPSR RKALHQQLAR VFEDQRIDPI RAAQHYLAAG DVPAALNSLQ QQLDPLMNQG
LSSAAAAILQ SIRPTIEQQH PELLFAWLQS YGEFLMATSQ ASEAEAMYRE ALALAWQPNQ
RAHLVWRLTG AMLHRNQATA AQSLLEQTIA TLDSREVRLH GLLQMALSKA ILMQSQFALA
RQAAEQAIAL TSQLDPEAIM TIAEIRARAG GTLAIVQQYT GEVDASIQTW LDVIAQTRIA
RLERVRPRAF VNLANLYYTK GDLNRAEATI NDAVTGLRRI GDVYAQARMQ HTQAIIQMMR
GQPQAALQTL EQACAIKQQI DDQQGWYNSR NQIAMTLLAL GQTEQAEAIA QNLLHMLGEG
GEPFFRGVIL DTLAIGQLLR GDLRAARQSL DRIAAMPIAQ SNNLLRMAWE RRSVLLMLLT
EGAEHARTVF SHSLPLAGNG EIALDHACLD ALITQAAGDE LAAQRQWQQL AQRAASNGYE
YYRIVAEAQL QAPSHVSLQQ RVLQMHQPFA PNCWYQPALA QAVND