Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3739 |
Symbol | |
ID | 5735603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4699281 |
End bp | 4702013 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280891 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001546503 |
Protein GI | 159900256 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0768606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAAGC TTGATGCCTC TTTTGGAATG TGGCTTAAAC AACGCCGAAA GACCCTCGAT CTGACCCAAG ACGAGTTGGC TCATTTGGTT GGCTGTGCCA CGGTCACGAT TCGTAAAATT GAAGCTAATA CGCTTAAGCC TTCAACCCAG ATTGCTCAGC GCTTAGCCCA ATGTTGTAAT GTTCCTGAAG CTGAACATAC AGCATTTGTT CATTTTGCGC GTTCGGAAAC AACCACTGGG CCAGTATGGT CTGACCAAAC GCCAGGTGCT GCACGGGTCA ATTTTGTGGT CGATCAACCA CCCCACAATT TGGTTGATTT ACCCAATCCA TTGATTGGTC GTGATGCTGA TGTTGCGACC ATCAATGAGC GTTTGGCCAA CGAGCATGTG CGTTTATTAA CTTTGGTTGG CCCGCCCGGG GTTGGCAAAA CTCGTTTAGC GTTGCAAGTA GCCCAACAAC AACTTGAGCG CTTCCGCCAT GGGGTGTTTG TCGTAGCCCT CGCTCCCGTA ACCAATCCTC AAGATGTGTT GAGCGTGATC GCGCAAACCC TAGGAATCAA AGAAACTGGC ATTCGTCGCA GCTTCGAAGA TCTCAAAAAT TTTTTGTACG ACCGTGAATT ATTATTGGTG CTGGATAATT TCGAGCAGGT GTTGCCCGCC GCTAGCTCAA TCGATCAGCT GATTCAGGCT TGTTATGGGC TGAAGGTGTT GGTCACTAGC CGCGAGGCTT TGCGCTTACG CCGTGAACGC CGTTTTGCGG TTGCGCCGTT GGCTATCGCA ACTCCCGTCA GCGATGAGCC AAGCGCCACG TTCTCGCCAG CTGTAGCACT GTTTATTGAG CGAGCACAGG CCGTCAATCC CGATTTTGAG ATTAACGAAA CCAGTCTGCA CGATATTAGT GCCGTTTGTC GCCAGCTTGA TGGCCTGCCG TTGAGTATTG AGTTGATTGC TGCCCGCAGT ATGTTGCTGG CACCCAAAGC GATGTTGCGC CATCTTGAGC ATCAATTAAC CGTCTTAACT AGCCGATCGT CTGATCATCC GCCGCGCCAA CGCACCCTGC GTGATGCAAT TCGTTGGAGT GTCGATCTGC TTGAGCCAAG CGATCAGCAG ATGTTTATGC ATGTCGGGGT TTTTCCACAA AGCTGTACGC TTGAGTCACT TGCCGCCGTC GGGGCTGAGC AAGCTTGGGC ACTCGATTTG CTCGATGGCT TAAATACCCT CGCCGATAAA AGTTTGCTCT ACGCCAAGGC TGATCAACAG GGCGAAACGC GCTTTGAGAT GTTGAATGTG TTGCGCGAAT ATGCCCGCGA GATGTTGCAA CACGCTGGTT TGTTGATCCA AGCTGCCCAA CGCCATGCCC AATATTATTT GCAGTTGGCG CAAATCCTCC AAGCCGATCT CAGTCAGCAT AACCAACATG TGACCGCTGG CGATCGTTTT GAGCGCGATT TATTCAATTT TCGGGCGGCC TTAGAATTTT TCTTTACCCA ACGCCAAATC GAACAAAATG TTCAGCTTGC TACCAGTTTG GCCGATTTGT GGTATTTGCG CGGCTATGCT GGTGAGGGTC GGCGTTGGCT GGCTCAAGCA ATTGAGCAAG CCCAAACCAC CCAAACTACG CTTGAGCCAA CCCTCTGGAT CGAGGCACTC AATGCTGCGG GCTACTTGGC CTATCATCAA GGCGATTATG GCGATGCCGC CCAAACCTTC TCGCAAAGCC GCCAGCTGAT CGAAACTGCC AATGATCAGG TTGGCATGGC GCGGGTCTAC AATAATTTGG GCTTAATTGC TCATTGGCAA GGCCAATATG CCCAATCTGA GGCTTTGCTC AACGATAGTT TGCAAATTTG GCGCAAACTT GATCTATCGA TGGCGATTAG TAGTTTGGTC TGTAATCTTG GGGCGCTTCA GCTTGATCGT GGTGAATTGA GCCAAGCTGC CGAGTTGCTT GAACAAAGCA AAGTGCTCTG TCAGCAACAG CATTATGAAA ATCGCATCTC AATGGTTCTG CAACATCAAG GTAAATTGGC GCTCTACCGT GGCGATTATG CCACCGCCCA GCGTTGTTTC AGCGAAAGTC AGGCCATCGC CGAGCAGATG AGCAATAAAA CCGTGATTGG TTTTGCCCTG ATGTACCAAG GCTATGCCAC GCTGGCCGCA GGCGAATTGG AGCAAGCCGC CTGTTATTTG AGCAAAGCTG CCCAGATGAG CCAAGATCTT GGTAGCAAGC ATATGGTGTG TATGGTGTTG GCGGTGCAGG CACAGTTGGC TGTCGAACAA GCCCAATACG TGGTAGCCCG CCAGTTATTT ACCCAAGCCT TGGAGTTAGG CCGCAGCATG CAATTTGGCA CAGGTATTGC CAATGCCTTG CGTGGTTTGG CGCTGGTCGA TGCGATTCAA GGCCACTATA GCCCAGCGTT AGAGCAAATT AACCAAGCAA TTCAAGGCTA TCGTACCATC GGCAACCCCG AAGGCTTAAT TCAATCGCTT GAAACCTTGG CATTTTGTTT GACAACCATG GGCTATGGCA AGACAATCGC GCCTGTTGTC GCTGCGCTCG AACAGTTGCG CCACGAATAT GGCTTGCGGC GCTGGAACAA TCAACAAGCT CGTTGGCAAC AGATTCAACA GGCTCTAGCA CAGCCTGATC AATCAAACTT GCCAGCCGAG CCAATTGTTG CCCAATTACT CGACCAGATG CCTAGCCTGA AACTGCGTGA GCTGCGCACA TGA
|
Protein sequence | MTKLDASFGM WLKQRRKTLD LTQDELAHLV GCATVTIRKI EANTLKPSTQ IAQRLAQCCN VPEAEHTAFV HFARSETTTG PVWSDQTPGA ARVNFVVDQP PHNLVDLPNP LIGRDADVAT INERLANEHV RLLTLVGPPG VGKTRLALQV AQQQLERFRH GVFVVALAPV TNPQDVLSVI AQTLGIKETG IRRSFEDLKN FLYDRELLLV LDNFEQVLPA ASSIDQLIQA CYGLKVLVTS REALRLRRER RFAVAPLAIA TPVSDEPSAT FSPAVALFIE RAQAVNPDFE INETSLHDIS AVCRQLDGLP LSIELIAARS MLLAPKAMLR HLEHQLTVLT SRSSDHPPRQ RTLRDAIRWS VDLLEPSDQQ MFMHVGVFPQ SCTLESLAAV GAEQAWALDL LDGLNTLADK SLLYAKADQQ GETRFEMLNV LREYAREMLQ HAGLLIQAAQ RHAQYYLQLA QILQADLSQH NQHVTAGDRF ERDLFNFRAA LEFFFTQRQI EQNVQLATSL ADLWYLRGYA GEGRRWLAQA IEQAQTTQTT LEPTLWIEAL NAAGYLAYHQ GDYGDAAQTF SQSRQLIETA NDQVGMARVY NNLGLIAHWQ GQYAQSEALL NDSLQIWRKL DLSMAISSLV CNLGALQLDR GELSQAAELL EQSKVLCQQQ HYENRISMVL QHQGKLALYR GDYATAQRCF SESQAIAEQM SNKTVIGFAL MYQGYATLAA GELEQAACYL SKAAQMSQDL GSKHMVCMVL AVQAQLAVEQ AQYVVARQLF TQALELGRSM QFGTGIANAL RGLALVDAIQ GHYSPALEQI NQAIQGYRTI GNPEGLIQSL ETLAFCLTTM GYGKTIAPVV AALEQLRHEY GLRRWNNQQA RWQQIQQALA QPDQSNLPAE PIVAQLLDQM PSLKLRELRT
|
| |