Gene Haur_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2207 
Symbol 
ID5734094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2804459 
End bp2807164 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content51% 
IMG OID641279348 
ProductXRE family transcriptional regulator 
Protein accessionYP_001544975 
Protein GI159898728 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATC GGCCAATCTT CCCTGAATGG ATCAAACAGG CTCGTAAGCA ATTGGGCCTT 
GGCAGAGCCG AGTTAGCCCA CCAAGTTGGC TGTTCGTTGG CTATGTTACG TCAGCTTGAG
TATGGTACGC GCCGCCCTTC GCGCCAATTG GCCGAACGGC TCGCCAGCTT TTTGCAGATT
CCAGCCGAAC AGCAATCACT CTTTGTCCAA ACTGCGCGGG TGCATCAACC ACCAAGCCCA
ATCAACCCCG TTCCACTCGT GAAACGCCTG CCTGAGCCAA CCCAACCCTT AATTGGTCGC
GAGGAATTAT TGGCCCATGC CAGTCAGCTA TTGCTCCAGT CAAGCACCCG TTTGCTGAGC
ATTGTCGGCC CAGGTGGTGT TGGTAAAACC CATTTTGCAA GCCAGCTTGC CCAGCAGATT
CAAGCACATT TCAGTGATGG GAGCTTTTTT ATTGGCTTAG CTTCGTTGCA CGATGCCGAA
CAGCTACCAA CTCTGATCGC CCAAACACTT GAAATTCCCC AACCAACTCA TCAATCAACC
TTAGAGCAAT TGATCGGCGC AATTGACCAA CGATCGATCT TGCTGATGCT CGATAATCTT
GAGCATGTGA TGATGGTTGT GCCAATTATC AGTCAATTGA TCACCCAATG TAACAAGCTC
AAAATGCTGA TAACCAGTCG TTTTGCCTTG AAATTACACG ATGAACACCT GATTGATTTG
CCACCGCTCG ATGTACCACA ACACCCACCT GAGCACCAAG CTAGCGCCGA AACCAACTAT
TCCGCCGTGG AATTATTTGA GCTTCGCGCC AAAATGGTGC AACCTCAATT TAGCCTAACA
GCCCAAAATC GCGCGATCGT TGGCGAAATT TGCCGGCGAT TAGATGGCTT ACCCTTAGCA
ATCGAGCTTG CTGCGGCGCG AATTCGTGGC CTACCACCCC AAGCAATGCT CGCGAGGCTC
GATCGCCGGC TCGAATTGTT CGATCAAGGC AATAGCGATT TGCCTGAGCG GCATCAAACC
TTGCGCAATT TAATTGCCTG GAGCTACACG CTGCTCACGC CCAACGAGCA AACTATTTTT
CGCACGCTCA GCCTGTTTGC CAACCATTGG ACATTAGGTG CCGCCGAATA TCTGTGTCAA
GCTCAAATTG CCAAACCCCA AGTTCTCACA ATCTTGCTCA ATTTGCTTGA TAAAAGTTTG
GTGCGCGAAG AAAGCTCAAA TGATGGCATC GCCACCTTTG CCATGCTCGA AATTATTCGC
GAATATGGGC TAGAGCAGCT TGAACAAACC AGCGAGGCTC ACGCATTGCG CTGGCGACAT
GCCCATTATT ACATCCAACT GGCCCAACAC GCCGAGCAAC AACTCGGCCA ACAAGAAGCA
TTATGGCTTG AGCGCTTGAC GTGGGAGCGT AGCAACCTTT GGGATGCGCT CAATTGGCTT
GTGGCGCAGC AAGCAGCCGA AGCATTATTG CAGCTGATTG GTTCGCTCTG GAAATTTTGG
CAAATTCGGC ATCTCTGGCG TGAAGGTTTG CACTGGATTG AGTTGGCTTT GGCCTTACCG
CTGGTCAATA GCGAAGCTTA TCAACAAGCC CGTGCCAAAG TGCTTTGGGG TGCTGGTTGG
TTGGCAGTCG ATTTACATCA GCATGATTTG GCACAAGCCA TGTTCGAGCA AAGCCTGCAC
CTCGCCACCA GCCTCGACGA TCAACAGGGA ATTGCCCGAG CGTTGCATGG GGTTGGCTTA
TTGGCCGAGT GGGCGGGGCA GCGCGATTTT GCCATGCGAG CCTATCAAGA AAGTTTAAGT
CTCTTCCGCC AACTCGACGA TCAAGAGGAG ATTGCTTGGT CGTTGTTCCA TCTGGGAGCA
GCCTTACAAT CACAGGGTCA AGCCAAACAA GCGCGGCATT TTTTAGAAGC AGCCTTAGCG
ACCTCACGCC GCTTGGCCCA TTCGTGGAGC ATCGTGCATC AAACCAAAGC ACTCGGCCAA
ATGGCGATTG ATCAAGGGCG GTATGCTGAT GCCGAGTTAT TGCTCAACGA AGGTCTCAAG
CTCGCTCAAA ACCATCAATA TCAACAAATT TATCTTGAGA TTTTGCGGCA TCTTGGGCGT
TCGGCATTAG AGCAAGGCCA CTATCAACAG GCCCGTGAAC GGTTTCAAGC CAGCCTTGAG
CAAGCCCAAC TACTTAAAGA ACCCTCGGCG ATTCGTTGGG CCAGCATTCA CCTCAATTGG
CTGGGTATTC TCGAAGGCGA TTTGACCCAA GCCTATGCGT TTGAGCAACA ATTGGCAATC
TTCGAACGCG ACGAGCCTGC TTGGGCAATT GCCTGGCTCA AGGCGCGTTT GGGCACAATT
GCTTTACTCA AATGCCAGCC TGAAGTAGCC CAAAGCTGGT TTCTGGCTAG TATTCAAATG
TACCAAGCCA ACGATTTGCC ATGGGGCTTG GTCGAATGTC TTGAAGGCTT GGCACATAGC
CTATGGTTGA GCAAGCAAGC CCATCAAGCC GCAGCCTGCT TGCAATTGCT GAGCGCTGCC
GAACAACAAC GCCAAAGCTT ACCACGCCTG CGCTCCGTAC CGGAACAAGC AGCATGGCAA
ACCAGCCTTG CATGGTGTCA ACAACAACTT AGCCCTGAGC AATTTGCCCA CATTTGGCAA
ACTGGGGCGA CCCAAACGCT TGAACAATTG CTCAAGCCTT TTAATCACGA AACTCAATCA
GCTTGA
 
Protein sequence
MDDRPIFPEW IKQARKQLGL GRAELAHQVG CSLAMLRQLE YGTRRPSRQL AERLASFLQI 
PAEQQSLFVQ TARVHQPPSP INPVPLVKRL PEPTQPLIGR EELLAHASQL LLQSSTRLLS
IVGPGGVGKT HFASQLAQQI QAHFSDGSFF IGLASLHDAE QLPTLIAQTL EIPQPTHQST
LEQLIGAIDQ RSILLMLDNL EHVMMVVPII SQLITQCNKL KMLITSRFAL KLHDEHLIDL
PPLDVPQHPP EHQASAETNY SAVELFELRA KMVQPQFSLT AQNRAIVGEI CRRLDGLPLA
IELAAARIRG LPPQAMLARL DRRLELFDQG NSDLPERHQT LRNLIAWSYT LLTPNEQTIF
RTLSLFANHW TLGAAEYLCQ AQIAKPQVLT ILLNLLDKSL VREESSNDGI ATFAMLEIIR
EYGLEQLEQT SEAHALRWRH AHYYIQLAQH AEQQLGQQEA LWLERLTWER SNLWDALNWL
VAQQAAEALL QLIGSLWKFW QIRHLWREGL HWIELALALP LVNSEAYQQA RAKVLWGAGW
LAVDLHQHDL AQAMFEQSLH LATSLDDQQG IARALHGVGL LAEWAGQRDF AMRAYQESLS
LFRQLDDQEE IAWSLFHLGA ALQSQGQAKQ ARHFLEAALA TSRRLAHSWS IVHQTKALGQ
MAIDQGRYAD AELLLNEGLK LAQNHQYQQI YLEILRHLGR SALEQGHYQQ ARERFQASLE
QAQLLKEPSA IRWASIHLNW LGILEGDLTQ AYAFEQQLAI FERDEPAWAI AWLKARLGTI
ALLKCQPEVA QSWFLASIQM YQANDLPWGL VECLEGLAHS LWLSKQAHQA AACLQLLSAA
EQQRQSLPRL RSVPEQAAWQ TSLAWCQQQL SPEQFAHIWQ TGATQTLEQL LKPFNHETQS
A