Gene Haur_1608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1608 
Symbol 
ID5733510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1866787 
End bp1869195 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content50% 
IMG OID641278747 
ProductXRE family transcriptional regulator 
Protein accessionYP_001544379 
Protein GI159898132 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAG CACGATCATT TGGTCAACAA TTGCGCGACT ATCGCCATCA ACGCCAACTC 
ACTCAAGCGG CTTTGGCCGA GGAAGTTGGC TGCGCCATCG AGAGTATTCG CAAAATGGAG
GCTAATCGCC AGCGACCATC ACGCAGTTTG GCGGCTCGTT TAGCCAGAAT TTTGCAGTTA
TCAGCCGAGC AAAGCCAGAT TTTTTGCGAC CAAGCCCGAA CGGTTGGCAC TGATAGCGCC
AATTCAGCGC CAAAACCAAG TGGCTTGCCA TTAACGGCGA CCAAGCTGAT CAATCGCCAA
ACTGAGCTGG CAACGCTACA AAACTATCTC AACGCTGAGC ATATTCGGAT GATTACGCTG
ACTGGCCCAG GTGGCGTAGG CAAAACCCGC CTTGCGCTGC AAATTGCCCA GCATAGTCAC
AAGCATTTCC CCGATGGGGT GTATTTTGTC GATTTAGCTC AAGCAAGCAG CTTGGCGGAT
ATTGGTTTAG CCCTCAGTCA AACGCTCAAT CTGCCCAGTA GCAAATACGC TTGGCAACGC
CACATTCAAT TGCACTATCA ACAAGCCCGC ATCTTGTTGA TTCTCGATAA TGTTGAGCAA
TTGGTCAGCG CTGCCGAGCA TTTCCGTGGT TTGCTTGACC ATACCAGCCA GCTCAAATTG
CTCTTGACCA GTCGCACGCT GTTGCATTGC GCTGGTGAAT ATGCGATTCC GCTGACACCG
CTGCGCTTGC CAACTGCCGA GGCCAGCCTT AACGAGCTTA AAACCAATCC CGCCGTTCAG
CTTTTTGTCC AACGAGCGCA AACGCTCAAC CCACAGTTTG CCCTGACCAA CCACAACGCC
GAAGCAATCA AACAGCTTTG TTGGCAAGTT GATGGCTTGC CTTTGGCCTT GGAATTGGCG
GCGGCTCGCA CCCGTTTGCT CACGCCTGAA GCCTTGTTGG CTTATTTGCA ACCGCCCTTG
GCCTTGCTCA GCACCAATGA TCCAACGGCT CCAGCTCGCC ACCAAAGTAT GTACAACGCC
ATTAATTGGA GCTATCAGCA AATTTCGCCC AAGCAGCAAC AGCTTTTGCG CCAACTAGCA
ATTTTTCAGG CTGGATGTAC TTTGGATGCA ATTCAGGCTA TCGTGCCAAA CAATAATCAG
CTTGATCTGC TTGAACAATT GGCAGGCTTA ATTGACCATA GTTTGCTGAA CATGCAGGCT
GAAGCTGAAC AGCCGCAACG TTTTAGCATG CTCAGTTTGA TCCACGAATT TGCCGCGCAG
CAATTGGCCG AACAAGCCGA ATTTCCCGAA CTCGCCCAAC AGCATCTCAA TTATTATGTC
ATGTACTGCG AATCGCTCAG CCAACAAGTT TTCACGGCAC GCCAAGCGCT TTTATCGGAG
CGCGAGAATA TTCGGGCTGC AATTAACTGG GCAATCAGTA CTCAGAATTG GGTTGCAGCC
AGCAGTTGCA TTTTGCCCTT GGCCGAATTT TGGTATCGTT ATGGAGCCGC TGAAGAGTTA
CAAACGTGGC TGGCTTGGCT CCGCAGCCAA CCAATTGATT TAGCAACTCA AGCCCGTTGC
AACGAAATGC AGGGCTATAT TGCAGCCTTT TTGCAAAGCC AATATCGCGC TGGTCAGGCG
TGGTATCAAC AAGCGTTGGC GCAACGCCAA GCCCTGCAAC AAGCGGCGGC CATCGCCGAC
AATCTCGCCA AATTGGGCGA AGTTGCGATG GAGCAAGGCC ATTATGCCCA AGCGCTTGAA
CGCTATCGCC ACGCTTGTAG CATGCATGAA CAACTTGGCG ATCAAGCTTC AGTGTTTGCC
ATGCACGATT GCCAAGCTAT GGTCTTGCTG CGTCAAGGTC AATTTGGCCA TGCCCAACAG
CTGTTACAAC AAAGCTTAGA TTATTGGCAG CAACAACAGA TTTTGCCCAG CCTTGCATTT
AGCCTGAATT ACCTTGGGAT GATTGCCTTT TATCAAATGC GCTTGAGCAA AGCCCAACAG
GCGCATGAGC AAGCCTTGGC AATTTGGCAA ACCCTCGATG ATCAACGCGG GATTGCCTCA
GCCTTGAATG CTTTGGCTCC AGTCTTGTTG CACCAAAACC AAACCGCTGC TGCACTGGCA
GCAATCAAGC AAAGCTTGCA AATTCGCTGG AGCCTGCACG ATTACGATGG CCTCGCTTGG
AATTTAGAGC GGTTTGGTGA AATTTTGAGC AAAGTGCATC AAGCTGAATT GGCGATGCAA
TGTTGGAGCA AAGCCAAGCA ACTCCGCGAT GAACTAGCCT TGCCCTTGTT TGAGGCCGAA
CAAAAACGTT TGCAAATCTA CATTAGGCAA ACTAAGCAAC AATTAACCTC CGCTCAAGTG
CAACAGCTTT GGTTGAGCGG CCACAAGGTA GCGTTAGCGC AGCTAATTCA AACCCTCTTA
ATCACTTAA
 
Protein sequence
MPEARSFGQQ LRDYRHQRQL TQAALAEEVG CAIESIRKME ANRQRPSRSL AARLARILQL 
SAEQSQIFCD QARTVGTDSA NSAPKPSGLP LTATKLINRQ TELATLQNYL NAEHIRMITL
TGPGGVGKTR LALQIAQHSH KHFPDGVYFV DLAQASSLAD IGLALSQTLN LPSSKYAWQR
HIQLHYQQAR ILLILDNVEQ LVSAAEHFRG LLDHTSQLKL LLTSRTLLHC AGEYAIPLTP
LRLPTAEASL NELKTNPAVQ LFVQRAQTLN PQFALTNHNA EAIKQLCWQV DGLPLALELA
AARTRLLTPE ALLAYLQPPL ALLSTNDPTA PARHQSMYNA INWSYQQISP KQQQLLRQLA
IFQAGCTLDA IQAIVPNNNQ LDLLEQLAGL IDHSLLNMQA EAEQPQRFSM LSLIHEFAAQ
QLAEQAEFPE LAQQHLNYYV MYCESLSQQV FTARQALLSE RENIRAAINW AISTQNWVAA
SSCILPLAEF WYRYGAAEEL QTWLAWLRSQ PIDLATQARC NEMQGYIAAF LQSQYRAGQA
WYQQALAQRQ ALQQAAAIAD NLAKLGEVAM EQGHYAQALE RYRHACSMHE QLGDQASVFA
MHDCQAMVLL RQGQFGHAQQ LLQQSLDYWQ QQQILPSLAF SLNYLGMIAF YQMRLSKAQQ
AHEQALAIWQ TLDDQRGIAS ALNALAPVLL HQNQTAAALA AIKQSLQIRW SLHDYDGLAW
NLERFGEILS KVHQAELAMQ CWSKAKQLRD ELALPLFEAE QKRLQIYIRQ TKQQLTSAQV
QQLWLSGHKV ALAQLIQTLL IT