Gene Haur_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3739 
Symbol 
ID5735603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4699281 
End bp4702013 
Gene Length2733 bp 
Protein Length910 aa 
Translation table11 
GC content51% 
IMG OID641280891 
ProductXRE family transcriptional regulator 
Protein accessionYP_001546503 
Protein GI159900256 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0768606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAGC TTGATGCCTC TTTTGGAATG TGGCTTAAAC AACGCCGAAA GACCCTCGAT 
CTGACCCAAG ACGAGTTGGC TCATTTGGTT GGCTGTGCCA CGGTCACGAT TCGTAAAATT
GAAGCTAATA CGCTTAAGCC TTCAACCCAG ATTGCTCAGC GCTTAGCCCA ATGTTGTAAT
GTTCCTGAAG CTGAACATAC AGCATTTGTT CATTTTGCGC GTTCGGAAAC AACCACTGGG
CCAGTATGGT CTGACCAAAC GCCAGGTGCT GCACGGGTCA ATTTTGTGGT CGATCAACCA
CCCCACAATT TGGTTGATTT ACCCAATCCA TTGATTGGTC GTGATGCTGA TGTTGCGACC
ATCAATGAGC GTTTGGCCAA CGAGCATGTG CGTTTATTAA CTTTGGTTGG CCCGCCCGGG
GTTGGCAAAA CTCGTTTAGC GTTGCAAGTA GCCCAACAAC AACTTGAGCG CTTCCGCCAT
GGGGTGTTTG TCGTAGCCCT CGCTCCCGTA ACCAATCCTC AAGATGTGTT GAGCGTGATC
GCGCAAACCC TAGGAATCAA AGAAACTGGC ATTCGTCGCA GCTTCGAAGA TCTCAAAAAT
TTTTTGTACG ACCGTGAATT ATTATTGGTG CTGGATAATT TCGAGCAGGT GTTGCCCGCC
GCTAGCTCAA TCGATCAGCT GATTCAGGCT TGTTATGGGC TGAAGGTGTT GGTCACTAGC
CGCGAGGCTT TGCGCTTACG CCGTGAACGC CGTTTTGCGG TTGCGCCGTT GGCTATCGCA
ACTCCCGTCA GCGATGAGCC AAGCGCCACG TTCTCGCCAG CTGTAGCACT GTTTATTGAG
CGAGCACAGG CCGTCAATCC CGATTTTGAG ATTAACGAAA CCAGTCTGCA CGATATTAGT
GCCGTTTGTC GCCAGCTTGA TGGCCTGCCG TTGAGTATTG AGTTGATTGC TGCCCGCAGT
ATGTTGCTGG CACCCAAAGC GATGTTGCGC CATCTTGAGC ATCAATTAAC CGTCTTAACT
AGCCGATCGT CTGATCATCC GCCGCGCCAA CGCACCCTGC GTGATGCAAT TCGTTGGAGT
GTCGATCTGC TTGAGCCAAG CGATCAGCAG ATGTTTATGC ATGTCGGGGT TTTTCCACAA
AGCTGTACGC TTGAGTCACT TGCCGCCGTC GGGGCTGAGC AAGCTTGGGC ACTCGATTTG
CTCGATGGCT TAAATACCCT CGCCGATAAA AGTTTGCTCT ACGCCAAGGC TGATCAACAG
GGCGAAACGC GCTTTGAGAT GTTGAATGTG TTGCGCGAAT ATGCCCGCGA GATGTTGCAA
CACGCTGGTT TGTTGATCCA AGCTGCCCAA CGCCATGCCC AATATTATTT GCAGTTGGCG
CAAATCCTCC AAGCCGATCT CAGTCAGCAT AACCAACATG TGACCGCTGG CGATCGTTTT
GAGCGCGATT TATTCAATTT TCGGGCGGCC TTAGAATTTT TCTTTACCCA ACGCCAAATC
GAACAAAATG TTCAGCTTGC TACCAGTTTG GCCGATTTGT GGTATTTGCG CGGCTATGCT
GGTGAGGGTC GGCGTTGGCT GGCTCAAGCA ATTGAGCAAG CCCAAACCAC CCAAACTACG
CTTGAGCCAA CCCTCTGGAT CGAGGCACTC AATGCTGCGG GCTACTTGGC CTATCATCAA
GGCGATTATG GCGATGCCGC CCAAACCTTC TCGCAAAGCC GCCAGCTGAT CGAAACTGCC
AATGATCAGG TTGGCATGGC GCGGGTCTAC AATAATTTGG GCTTAATTGC TCATTGGCAA
GGCCAATATG CCCAATCTGA GGCTTTGCTC AACGATAGTT TGCAAATTTG GCGCAAACTT
GATCTATCGA TGGCGATTAG TAGTTTGGTC TGTAATCTTG GGGCGCTTCA GCTTGATCGT
GGTGAATTGA GCCAAGCTGC CGAGTTGCTT GAACAAAGCA AAGTGCTCTG TCAGCAACAG
CATTATGAAA ATCGCATCTC AATGGTTCTG CAACATCAAG GTAAATTGGC GCTCTACCGT
GGCGATTATG CCACCGCCCA GCGTTGTTTC AGCGAAAGTC AGGCCATCGC CGAGCAGATG
AGCAATAAAA CCGTGATTGG TTTTGCCCTG ATGTACCAAG GCTATGCCAC GCTGGCCGCA
GGCGAATTGG AGCAAGCCGC CTGTTATTTG AGCAAAGCTG CCCAGATGAG CCAAGATCTT
GGTAGCAAGC ATATGGTGTG TATGGTGTTG GCGGTGCAGG CACAGTTGGC TGTCGAACAA
GCCCAATACG TGGTAGCCCG CCAGTTATTT ACCCAAGCCT TGGAGTTAGG CCGCAGCATG
CAATTTGGCA CAGGTATTGC CAATGCCTTG CGTGGTTTGG CGCTGGTCGA TGCGATTCAA
GGCCACTATA GCCCAGCGTT AGAGCAAATT AACCAAGCAA TTCAAGGCTA TCGTACCATC
GGCAACCCCG AAGGCTTAAT TCAATCGCTT GAAACCTTGG CATTTTGTTT GACAACCATG
GGCTATGGCA AGACAATCGC GCCTGTTGTC GCTGCGCTCG AACAGTTGCG CCACGAATAT
GGCTTGCGGC GCTGGAACAA TCAACAAGCT CGTTGGCAAC AGATTCAACA GGCTCTAGCA
CAGCCTGATC AATCAAACTT GCCAGCCGAG CCAATTGTTG CCCAATTACT CGACCAGATG
CCTAGCCTGA AACTGCGTGA GCTGCGCACA TGA
 
Protein sequence
MTKLDASFGM WLKQRRKTLD LTQDELAHLV GCATVTIRKI EANTLKPSTQ IAQRLAQCCN 
VPEAEHTAFV HFARSETTTG PVWSDQTPGA ARVNFVVDQP PHNLVDLPNP LIGRDADVAT
INERLANEHV RLLTLVGPPG VGKTRLALQV AQQQLERFRH GVFVVALAPV TNPQDVLSVI
AQTLGIKETG IRRSFEDLKN FLYDRELLLV LDNFEQVLPA ASSIDQLIQA CYGLKVLVTS
REALRLRRER RFAVAPLAIA TPVSDEPSAT FSPAVALFIE RAQAVNPDFE INETSLHDIS
AVCRQLDGLP LSIELIAARS MLLAPKAMLR HLEHQLTVLT SRSSDHPPRQ RTLRDAIRWS
VDLLEPSDQQ MFMHVGVFPQ SCTLESLAAV GAEQAWALDL LDGLNTLADK SLLYAKADQQ
GETRFEMLNV LREYAREMLQ HAGLLIQAAQ RHAQYYLQLA QILQADLSQH NQHVTAGDRF
ERDLFNFRAA LEFFFTQRQI EQNVQLATSL ADLWYLRGYA GEGRRWLAQA IEQAQTTQTT
LEPTLWIEAL NAAGYLAYHQ GDYGDAAQTF SQSRQLIETA NDQVGMARVY NNLGLIAHWQ
GQYAQSEALL NDSLQIWRKL DLSMAISSLV CNLGALQLDR GELSQAAELL EQSKVLCQQQ
HYENRISMVL QHQGKLALYR GDYATAQRCF SESQAIAEQM SNKTVIGFAL MYQGYATLAA
GELEQAACYL SKAAQMSQDL GSKHMVCMVL AVQAQLAVEQ AQYVVARQLF TQALELGRSM
QFGTGIANAL RGLALVDAIQ GHYSPALEQI NQAIQGYRTI GNPEGLIQSL ETLAFCLTTM
GYGKTIAPVV AALEQLRHEY GLRRWNNQQA RWQQIQQALA QPDQSNLPAE PIVAQLLDQM
PSLKLRELRT