Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2123 |
Symbol | |
ID | 5734011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2666628 |
End bp | 2667872 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641279264 |
Product | two component AraC family transcriptional regulator |
Protein accession | YP_001544891 |
Protein GI | 159898644 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.331776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTACA AAGTCTTTTT AGTCGAGGAC GAGATCATCG CTCGTGAAGG GATTCGCGAT GCGATTGACT GGGCAGCGGC AGGCTATCAG TTTTGCGGCG AGGCATCGGA TGGCGAAATC GCACTGCCGC TCATCCGTGA GCGACGACCC GATATCCTTA TCACCGATAT TAAGATGCCG TTTATGGATG GCTTACAGCT GTGTCGAATT GTGAAGGAAA CGCTGCCAAC CACGAAGATT ATAATCCTTA GCGGTCATGA TGAGTTTCGC TATGCCCAAG AAGCAATGCA AATCGGGGTT ACGCAGTACT TGCTGAAGCC GATTGTTGCC CAAGATCTGC TGGCGGCATT GCGCAAGATC GCCAGCCAGA TCGATGGGGA GCGTCAAGCC AAGGCACAAT TGGAGACGCT CCAAGCGCAG ATGTTCGATC ACCAACCAAT GTTGCGTGAA CGCTGCCTGC TTGATCTGGT CTCTGGCAGT AGCTCGGCAG CCGATTTCAT GGAGCAAGCC CGCAACCTTG AAATCGACCT GCTGGCACCA TGGTATCAGG TGTTGGTGAT GCACGCCATG CCACCGAGCG CCGCTACAGC GCCGCTGTAT ACGCTCTATC AGCAGGTCGA TGTGACTGTC GCTGCCAGCT TAAACCAATC ACCGTTGGTC GTGGCCTTTA AGCATGGCCT CGAAGATACC ATCTTGATCG TCAGGGGCGA GACTCGCGCT GATATGACCC AGCAGGCGGA GCGACTAGCC ACTGCAATGC GCCAGCGCGT GGCTGAGCAG CTTGGCTGTC GCGCGATTAT TGGGATCGGC GACCCCACCG AGCGACTCAG CCTGATCCCC CAATCGTTTG CTGAGGCATT GGCGCAGATC AGTAGCTTCG AGCGCCCAGC GGAGTCTGAT CCATCCGATC AGGGACAATT CCACGGCGGG GCGATTATGC TGAAAGCCCT CGCCTATATC GATACCAATT ATGCCGATCC TGCGATGTCG TTGGGCCAAG CGGCGGCCCA TGTGTTGCTT AGCCCGACCT ATTTTAGTGC GCTCTTCCGT CGCGAGGTTG GCGAGACCTT TATCGACTAC CTGACCCAAG TTCGCATTCG CAAAGCCATC GAACTGCTAC GCTCGACTTC CCTAACGGCC AGCGAGATCG CTTATCGTAT TGGCTATCAG AACCCGCGCT ACTTCTACTC GGTGTTTCGC AAGGTTGTCG GTCAGCCACC CAACGAGTTC CGTCAGCGCT TCTAG
|
Protein sequence | MTYKVFLVED EIIAREGIRD AIDWAAAGYQ FCGEASDGEI ALPLIRERRP DILITDIKMP FMDGLQLCRI VKETLPTTKI IILSGHDEFR YAQEAMQIGV TQYLLKPIVA QDLLAALRKI ASQIDGERQA KAQLETLQAQ MFDHQPMLRE RCLLDLVSGS SSAADFMEQA RNLEIDLLAP WYQVLVMHAM PPSAATAPLY TLYQQVDVTV AASLNQSPLV VAFKHGLEDT ILIVRGETRA DMTQQAERLA TAMRQRVAEQ LGCRAIIGIG DPTERLSLIP QSFAEALAQI SSFERPAESD PSDQGQFHGG AIMLKALAYI DTNYADPAMS LGQAAAHVLL SPTYFSALFR REVGETFIDY LTQVRIRKAI ELLRSTSLTA SEIAYRIGYQ NPRYFYSVFR KVVGQPPNEF RQRF
|
| |