Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3806 |
Symbol | |
ID | 5735670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4777512 |
End bp | 4778783 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280958 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001546570 |
Protein GI | 159900323 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0119583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAGG TAGTACGTGT TTTTCATCAA CTCTTGCCCG ATTTTGTTGA TCATGTGTCC GAGCAAATTG TTGCTAACCA TGTGCCTGTG TATGCCAGCT TGCCGCAGCA GCATGTCAAA ATGGCGCTCT ACAATGCCAT TCATTCAATC GAGATCGATT TAGCCCAAGG CACAACCTCA ACCTATGCCG ATTATTGGCG TGAGGTGGCG GTGCAACGTG CCCAACAAGG CATTTCACCA GTTCACAGCA TGCTCGTTAC CCATCTTTCA ACCAATGTGA TGACCCAATT TTTGAAGCAA GCCTTGGATC GTGAGCCAGA AGCCTTGGCC TGGTGGCTCG AACGTACCCA CACGATTATT TCGCTGGGCA TGTTGGTAAT GACCGAGGCA CGAATTAATG CCTTGCAACA ACTTGGGCAG CTAGGAAGCG AGCCAGCCAG CAGCGGGATT ATCATCCACG AACAGCCCAA TTTGATTTTG CCACCTAGCC GCTGGCAAAC CCCACAAGTG TTGCATTTGC AAGCCTTTGG CCAAATGCGG GCCTGGCGTG GCTCAAGCGA AGTGCTCAAT TGGGGTCGTA AATCGGCAAT CGCCTTGCTT GGGATTTTAA TTACCCAACG CGGCCAATGG ATTCAGCGTG AGCAAATTTG CGATCTGTTC TGGCCTGATT TAGCCCCAAA TCAAGCCGAA GCCCACTTTA AAGTTGCCCT GAATGCGCTA ACCGCTGTGT TGGAGCCAGA ACGCCCGACG CGCCAAGCCT CAAGCTATAT TCAACGCCGT AATACTGCCT ATCGTTTGGC GTTTGATACT GCGCCAATTC AGCTGGATGT GCTACGCTTT GAGCAATTGC TGCAACGTGC CAACCACGCT AGCAATCCAC TAGAAGCCAT TAATTACTAT CGCCAAGCAC TCAATTTGTA TGCTGGCGAT TTTTTGGGCG ATTGTTTGTA TAGCGATTGG GCGAATGCTG TGCGTGAGCA ATTGCGCCAT CATTTTGTGC AAGCTGCCTG CGAATTAGCC CAATTATTGT TGGCTGAACA GCAACCAACC GAAGCCTTGG AGTGGGCCGA AGCCGCTTTA CAAGCTGATC CCTATCAAGA AAACGCCTAT CAAGCTAGCT TTATGGCCTA TGCCCAACTT GGCAATCGGG TGCAATTGCA ACGCAGCTAT CAACGTTGCC AACAAGTGCT TGAGCACGAT TTGGGCTTAG CTCCAATGCC CACCACCAAA GCCGCCTACC AACGAGCCGA ACAAACGCTG CATCAACTGT AA
|
Protein sequence | MQQVVRVFHQ LLPDFVDHVS EQIVANHVPV YASLPQQHVK MALYNAIHSI EIDLAQGTTS TYADYWREVA VQRAQQGISP VHSMLVTHLS TNVMTQFLKQ ALDREPEALA WWLERTHTII SLGMLVMTEA RINALQQLGQ LGSEPASSGI IIHEQPNLIL PPSRWQTPQV LHLQAFGQMR AWRGSSEVLN WGRKSAIALL GILITQRGQW IQREQICDLF WPDLAPNQAE AHFKVALNAL TAVLEPERPT RQASSYIQRR NTAYRLAFDT APIQLDVLRF EQLLQRANHA SNPLEAINYY RQALNLYAGD FLGDCLYSDW ANAVREQLRH HFVQAACELA QLLLAEQQPT EALEWAEAAL QADPYQENAY QASFMAYAQL GNRVQLQRSY QRCQQVLEHD LGLAPMPTTK AAYQRAEQTL HQL
|
| |