Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1813 |
Symbol | |
ID | 5733671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2106410 |
End bp | 2108857 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278956 |
Product | ankyrin repeat-containing protein |
Protein accession | YP_001544584 |
Protein GI | 159898337 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000177587 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTCA ATTCATTAAA AAAACTGCTC AAACACCAAG CCTACCACCA AATACTTGAG CATTTGCAAA CTGGCCCAAC TTGGAAGCAA AAGTCGCTTG ATCATGCCCT CTATCAGCTT GCCAGCCAAC CAGCTGAAAC CCTGATCGCC ACATTACTCA GCGCTGGCGC AAACCCCAAC CAATCACTGC AAGACAACCG CTATATCAGC TTGCATCGGG CAGCAATCTA TAATCGTTAT GCGATCGCTG AGCTTTTGTT GAGCCATGGA GCCAACTGCC ATGCCCAAAG CGCTAAGCAA CAAACGCCAC TACATTTGGC TTGTCTCAAC GGCCATTTCG AGCTAGCCCA ATTGCTTTGG CATAACGGCG CTGATCTGCA TGCCCAAGCT GATTCAAACT TCACACCATG GTTATATGCA GCCAGCAGTG GCAATCTCCA ACTACTCAGG TGGTTGTTAG ACCAAGCGGT TGATATTGAT CAAACGACGG CTAATGGCAT CTCGGCCCTA ACGTTGGCAG CGTGGAATGG GCATCAAGCA GCCGTCGAAT GGTTGTTAGC CAATGACGCA GCAATCGAAG GCCCGCCAAC CAGAACTCGC ACCCCGTTGC ATGCTGCATT AAGTAAAGGC CATATGGCAA TTGCCAATTT GTTGCTTGAT CACGGGGCAG CGGCCAACGC CATAACCAGC GAGGGCAATA GCTGCGTGTG TTTGGCTGCT TGGCACAATG CTACTGATTT AGTTGATCGT TTATTGCAGC TTGGCTCACC CATTAATTGT GCGATGGCAA AACCGCATGC GACCGCCTTG CATGCTGCGG CCCAACTCAA TAACCCTGCG ATGGCCCAGC AACTTTTAGC CAACGGAGCG AAGATTAATG CACTCAATCC ACAAGGCTTG ATGCCATTTC ACACAGCGAT CAGCACTATT TGCGCGACTA AAAGCGCCGA TCTCGCGTTG ATCGAGGTTT TTCTGCGGAT GGGAGCCGAT CCTAACCAGC CAAGTTATGC AATCAAGCAG CTTGGCAAGG AAACGCAGGT GCGCGAACAT TGGCGACCCT TAGGCTATGT GTTGGCCCAA AAACGGCGTG ATTTAGCCGA GTGTTTATTG GCTTATGGTG CTGAGCTTAA TTTGCCAAGC TATGGCAAGT GGGCGATGGA GCAAACACCA CTTGAAGTTG TGGCTAGCGC CGAGGTTATT GATCAAGAGG CTGGCGAATT GTTTAGCTGG CTCCTCAGCC TCAAACCGCA AATTCCCCCA AAGCTGCTGC CAAGCATGCT TATAACGCAA AAATTCGGCT TTGCCCAACA ATTAATCGCG GCTGGAGCCG ATCTGCATGC TCCCGATGTT TTAGGCGCAG CGATTACCGC TAAAAACGAG CACTATGTTC ACGATTTTTT GGCACGAGGC GTAAATGTTG ATAGCCCATA TCGCAATTAC ACAACCGTGT TACAGCTAGC ACTGAGTTCT TATCCCCAAT TCGCGCTGCG ATTAATTGAG GCTGGAGCCG ATTTAACCCA CTTGGTTGAT GAGAACCCAA GCCTCAATTA TCCAATGCTG ATTCGCCAAC AACGCTCAAA TCGGCCAGCG ATCAGCAATT TAATGCTACT CGATTTGATT GCCGAAGCCA TGCTAGCCCA GCTTGAGCCT GCAAACCCAG CTTATCAAGC CTTGCTCGAT CAAGAGTTTG CCGAGCGCGT TTGTACAGCA AACGAAACAA CATGGCAGAT TTGGCTGGCA CGCGGAGCCA ATATCCATGC GCTGAATCAT GCTGGATTGT TGGGGTTCAG CCAGCTTTGC GTCCACGGCG ATTTAGCTGG CGCTCAAACA CTCTATGCCA ACGGCGCAAA TATCAATCAA ACTGACCATT TTGGCCGCAC TGCCTTGCAC TGGGCAGTTG AGCGGCAACA GCTGGCAAGC ATACAACAAT TGCTGGCTTG GGCTGCGGAT ATGCATAGTG CAACGCCCTA TGGCTACACG GCGTTGCACT ATGCCGCCTT AGCCAATCGA CTGGATCTGG TTGAATTGTT GTTGCAAGCC AAGGCTGATC CGACGGTGCA ACTCACGACG GGGCGCTTAC AAGGCTGGAC GGCATTACAC TGCGCCTATG CAGTCGATAA TCAATCATTA ATTAAATTGC TGCACCCACT AACGCCCACA ATCACGCCGC CAGAGCCAGG TTCGCAGCAT ATTCAAGGAA CCTACGACGT AACAATGGCG CATAACGGCT GGCACAAACC ACGCCCAATT AGCCAGCAAA CCCAACGTTG CCCCGCTTGC GCCGAGCACA TGCTCTACAA CACTGCTCAC AGCTTCGATG GCTCAGGCCA ACTAGCCGAT CGAATTGAGA TCTATCGCTG TGGGAATTGT CAGGCCGTAT TTTGGGAGAA TAGTATGGCT ACGTGGCGCT CACGTTTGCA GCCATGGTCA AGTTTTGTGC CGGATTAA
|
Protein sequence | MNLNSLKKLL KHQAYHQILE HLQTGPTWKQ KSLDHALYQL ASQPAETLIA TLLSAGANPN QSLQDNRYIS LHRAAIYNRY AIAELLLSHG ANCHAQSAKQ QTPLHLACLN GHFELAQLLW HNGADLHAQA DSNFTPWLYA ASSGNLQLLR WLLDQAVDID QTTANGISAL TLAAWNGHQA AVEWLLANDA AIEGPPTRTR TPLHAALSKG HMAIANLLLD HGAAANAITS EGNSCVCLAA WHNATDLVDR LLQLGSPINC AMAKPHATAL HAAAQLNNPA MAQQLLANGA KINALNPQGL MPFHTAISTI CATKSADLAL IEVFLRMGAD PNQPSYAIKQ LGKETQVREH WRPLGYVLAQ KRRDLAECLL AYGAELNLPS YGKWAMEQTP LEVVASAEVI DQEAGELFSW LLSLKPQIPP KLLPSMLITQ KFGFAQQLIA AGADLHAPDV LGAAITAKNE HYVHDFLARG VNVDSPYRNY TTVLQLALSS YPQFALRLIE AGADLTHLVD ENPSLNYPML IRQQRSNRPA ISNLMLLDLI AEAMLAQLEP ANPAYQALLD QEFAERVCTA NETTWQIWLA RGANIHALNH AGLLGFSQLC VHGDLAGAQT LYANGANINQ TDHFGRTALH WAVERQQLAS IQQLLAWAAD MHSATPYGYT ALHYAALANR LDLVELLLQA KADPTVQLTT GRLQGWTALH CAYAVDNQSL IKLLHPLTPT ITPPEPGSQH IQGTYDVTMA HNGWHKPRPI SQQTQRCPAC AEHMLYNTAH SFDGSGQLAD RIEIYRCGNC QAVFWENSMA TWRSRLQPWS SFVPD
|
| |