Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1855 |
Symbol | |
ID | 5733744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2156362 |
End bp | 2158857 |
Gene Length | 2496 bp |
Protein Length | 831 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278999 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001544626 |
Protein GI | 159898379 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAGC GCCAAGCGTT TGCTGCATGG CTCCGTCATG TACGCCATGA ATTACAATAT AGTCAAGACC AATTTGCCGA ACAACTTAAC TATGCCACGG TGACCTACCG CAAGGTTGAG CGTGGTTTAG CACCATCGGC AGCATTTTTA GAGCGGTTGG CCATGGTGCT TGAGTTACCA GCCAGCGATA TGCGGATCTT GCACGACTTC GCCAACTCCG ACACCGCGCT GCAAGCACTC TCGTTGCCAG ATCATCTGGA GCATCTTCAG CCGCAGGTAG CCCAGGCCAA GCCAGCAATT GCCGCTACGC CGCATACCTT GCCAATGTTG CCCTACCCGT TAATTGGCCG TGAACGCGAG GTTGAAACGC TTACCAAGTT GTTACAGCAC CCGCAACATC GCTTGATTAC CGTGATCGGC CCGCCTGGGG TTGGCAAAAC GCGGGTCGCT CAGGCCGTTG GTTGGGCCAG CCTCGGCCAT TTTTGCGATG GAATTTGGTA TGTTGAAGGC ATTCAATGCA CCACAATTGC CGATTTTTGG GTCGATATTG CCAATATGCT CGGGCGTTCG GCCAATAGCT CCATGACCTT AATCGAGCAA ATTAGTGCAT TAATCGGCCA AAAAAATAGC TTGCTGATTT TAGATAATTG CGAGCATTTG AGCGAAATTA ATCTTGGTTT AGCCCAGTTA TTGGCGCAAT GCAGCGGCTT GAAAATCTTG GTAACCTCAC GGACAAGCCT AAAACTCCGG ATCGAACATC TTTTTTGGCT GCATCCTTTT CCTACGCCCG ACCCGCAAAG CAGCAATCTG AGTGCGATTT GGCAGAACCC AGCGGTACAA CTTTTTTGCC AGCGTGCCCA AGCTAGTAAC CATGAGTGGC AAATCAACGA TAGCCAAGCA GCAACCATCG CCCAAATTTG CCAACATCTT GATGGCTTGC CCTTGGTGAT CGAATTGGCG GCAGTACGCA CCCAATTTGT TACGCCAACC ACGCTACTGG CACGGTTAAG CAATCGATTA GGCATCTTAA CCAACACCAT GCGCGATGCT CCGGCGCATC AAAGCACCCT GCGACGCACA CTTGAATGGA GCTATCAACT GCTTGATAGC AACGAACAAC AGATCTTTGC GCGGCTCAGT GTTTTTGCAA CCGATAGCGA TTTCGAGGCG ATTGTGGCGG TTTGCGCCGA TTTGGCACCA TCGAACGATG ATATTTTTGA TTGTATGGCC AGCCTCGTTG CCAAAAGTCT GGTTATTCAT CGACCCGACC CTCAAGGCAA TTCACGCTTT GGCATGTTGG CGACAATTCG TGAGTTTGCG GCGAGTTTGT TAGCTGAGCA ACAACAAACG CATCATTATA CCCAACGCTA TATTAATTAT TACATTGAAC TCGCCGAAAA AATTGATCGC GAACTGCGCG GCAAAGAGCA AATTCAGCTG CTCGAGCAGC TTGAGTCGGA GTTTCATCAT TGGCAAGCGG TTTTACGTTT ATGCCTCAAT CAACAACAGT ATCATGGCTT TTTACGGCTG TTTGCAGCAC TGAGCCAATT TTGGTATGGT CATGGCCATT TTATGGAGGC TTGGCAATGG CTCAGCGCAG TCGATCAAGC GTTAAATCAC GTCGATTCGC CCATTATTCA AGCGCGGGCG GCTCTAGGCG CAGGCATTGT AACCAATATT CATCATTGCC TTGATCTCCC GTTGGGCTAT CTCGAACGCG CCCTCGATTT ATGTCAACAA TTGAATGATC AACAGGGGAT TGCCACATGT TACTTGTTGC TTGGGTTGAT TATGATGCGC AAACATCAAT ATGTGCAAGC AACCCGCTGG CTCAACCAAA GCCTTAATTA TTTTGAGTCG AGCGTTGAGT ATTGGCTTTT GAGCATTAAT CATTTGCTGC TGGCTCAATT AAATATCTAT CTCAACGATC TTGATCAAGC TAGCCGCTAC CTTGATTTGG TGGGGCATTC GCCACAACTC CGGCTTGATC CCTTCCGATC ATCGTGGTAT CAATCATTAC AAGGCCATGT TGCCTTCTAC AAACGCTGCT ACACCGAAGC ACTGACGTGG CATCAACAGA GTTTGGTCGA GCGCCAGCAG CTCGGCATCA AAGGTGATAT TGCGGTTTCT TGGCTGCGGA TTGCTCAAAC CGAGCGAGCT TTAGGCCACT ATCAACCAAC CCGCAACGCC CTCGAACAAA GCCTCAAGCT CTGGCAAATG CACGACAATC AAGAAAACGT CTTGCATTGT TTAGAAGAAT TTGCCGCGCT ACTGGCCTAT GATCAACAGC ATCAGACCGC CACCTATCTG CTGAGCTATG CATGGTTTCA ACGTGAACAA CGTCAATTGC CCCATCCACC AATCGATCAG GCGCGATCGC AGCAATTTGG CATGTGGCTG CAAAACCAAC AACCAAGCAA TGTCTGGCGC GAAGCTTGGA GTTACGGCCA AACACTCAAA CTTGATCAAG TGATTGGCTT TGTGCTCGCG GGGTAG
|
Protein sequence | MKQRQAFAAW LRHVRHELQY SQDQFAEQLN YATVTYRKVE RGLAPSAAFL ERLAMVLELP ASDMRILHDF ANSDTALQAL SLPDHLEHLQ PQVAQAKPAI AATPHTLPML PYPLIGRERE VETLTKLLQH PQHRLITVIG PPGVGKTRVA QAVGWASLGH FCDGIWYVEG IQCTTIADFW VDIANMLGRS ANSSMTLIEQ ISALIGQKNS LLILDNCEHL SEINLGLAQL LAQCSGLKIL VTSRTSLKLR IEHLFWLHPF PTPDPQSSNL SAIWQNPAVQ LFCQRAQASN HEWQINDSQA ATIAQICQHL DGLPLVIELA AVRTQFVTPT TLLARLSNRL GILTNTMRDA PAHQSTLRRT LEWSYQLLDS NEQQIFARLS VFATDSDFEA IVAVCADLAP SNDDIFDCMA SLVAKSLVIH RPDPQGNSRF GMLATIREFA ASLLAEQQQT HHYTQRYINY YIELAEKIDR ELRGKEQIQL LEQLESEFHH WQAVLRLCLN QQQYHGFLRL FAALSQFWYG HGHFMEAWQW LSAVDQALNH VDSPIIQARA ALGAGIVTNI HHCLDLPLGY LERALDLCQQ LNDQQGIATC YLLLGLIMMR KHQYVQATRW LNQSLNYFES SVEYWLLSIN HLLLAQLNIY LNDLDQASRY LDLVGHSPQL RLDPFRSSWY QSLQGHVAFY KRCYTEALTW HQQSLVERQQ LGIKGDIAVS WLRIAQTERA LGHYQPTRNA LEQSLKLWQM HDNQENVLHC LEEFAALLAY DQQHQTATYL LSYAWFQREQ RQLPHPPIDQ ARSQQFGMWL QNQQPSNVWR EAWSYGQTLK LDQVIGFVLA G
|
| |