Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1099 |
Symbol | |
ID | 5732990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1259478 |
End bp | 1260557 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641278237 |
Product | ArsR family transcriptional regulator |
Protein accession | YP_001543875 |
Protein GI | 159897628 |
COG category | [K] Transcription |
COG ID | [COG0640] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0967799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAC TCGTTCGCGC TCAACCGGCA TTTAAAGTTG ATTTTGTCCC ATCACTTGGG TTGGATCTGC TTTCGACAAT GGGTCTGATT GGAATTGTCC ACGATTTTGA AGGCTTGGAT GCATGGCTGG TTGAGGCTGC TGCGAGTGTG CCCCCGCGCT TACGCCACGA TATTCAGCTG GCAATGCGCA TGGGCGTTTA TCCCTATGTG GTGGTCGAAA CCGTCTCTGA CCAAATTTTA CAGCCTGGTG CTGCTGGCCA CGACGATTTT AATGGCTTGA TTGAAGACCT CAAAGCGCTT TCACCGCAAG AATGTGCCGC GATGGTACAT AAAATCGTGC AACGCACCGC CGCCAACGCT GATGTTGAAT TATTGCACAC GCCAGCCGAA ATTATTGCCG ATCAAGAGCA ATTAGAAGAA TTATTGGCTA AAATGCAGTT TCCGGTCGAT ACCGATGAGC TGATTGAATT ATTGCAACAG CCAACCGAAT GGCGCGATTT ATTGGTCTCA ACGATTCAGC GCTTTTGGGA CCGGATTTAT CGTGAGCAAT ATGAACTGCA ACAAGCCCGC CGCGAACGTA ATGCCCATTA TCATCGCACA CATCAATATA GCGTCAACTT CCGCGATTTA TTTGCTGGAG TAACTGGCCG CCGCTTGCCC GACCATATTC ATGAACGACT TGGCACGATT AGCACTGTAC GTTTTGTTCC ATCGCAATAT ATTGGGCCAT ACTTGTCGTT TCTTTTCAAT GGATCATTAC TCACGGTGTT TTATAATAGC AGCACCACAC CAGCTGAAGG CGATGAGCAA ACTGAACGCA CGCAAAGCCT GTATCAGCCA TTAGCAGCCT TGGCCGATAA AACGCGCTTG CAAATTATGA CGTTGTTGCA TGGCCGCGAA TTGTATGCCC AAGAAATTGT CAATTTGCTC GATATTCATC AATCGGCGGT TTCACGCCAT TTGAAGCTGA TGGAAACTTC AGGTGTGCTG AATGTTCGCC GCGACAAGGG TGCAAAATAT TATTCGATCA ATCGCCAACG GATTGAAGAA ATTTCGGCTC GCCTACGCGA ATTTGTCTAA
|
Protein sequence | MTELVRAQPA FKVDFVPSLG LDLLSTMGLI GIVHDFEGLD AWLVEAAASV PPRLRHDIQL AMRMGVYPYV VVETVSDQIL QPGAAGHDDF NGLIEDLKAL SPQECAAMVH KIVQRTAANA DVELLHTPAE IIADQEQLEE LLAKMQFPVD TDELIELLQQ PTEWRDLLVS TIQRFWDRIY REQYELQQAR RERNAHYHRT HQYSVNFRDL FAGVTGRRLP DHIHERLGTI STVRFVPSQY IGPYLSFLFN GSLLTVFYNS STTPAEGDEQ TERTQSLYQP LAALADKTRL QIMTLLHGRE LYAQEIVNLL DIHQSAVSRH LKLMETSGVL NVRRDKGAKY YSINRQRIEE ISARLREFV
|
| |