Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1973 |
Symbol | |
ID | 5733862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2421714 |
End bp | 2424518 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279117 |
Product | tail collar domain-containing protein |
Protein accession | YP_001544744 |
Protein GI | 159898497 |
COG category | [S] Function unknown |
COG ID | [COG4675] Microcystin-dependent protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAA TTGATGATGG GTTAATCGTT TATTTGCAAT TAGATGCGTT AAGTGCGGGC CAAACCGTTT TAGATATTTC GGGCAACAAC AACAATGCTA TCGCCCATGG CTCACTCCAA CTCGTTTCCG ATGACGTGTT TGGCAGTGTA CTGGATTTTG ATGGCAACGG CTATCTCGAA TTGAATCCCG CTAGCATTCC GGCTGGTAAT GCGATCACTG TGAGTTTTTG GGCCTGTGGG GCAGAAAATT TGCCTGGCGC AGGCGTTTCG GCGATTGCTG CCTTGACCGC TGCTGATGAG CGTTCGCTGA ATGTGCATTT GCCATGGAGC GATGGCCAAG TTTATTTCGA TTGTGGAGCC GAAGGTAATA CTTGGGATCG TATGAACCAA GCGGCCAATG CGAGCGATTA CAAAGGCAGC TGGACCTATT GGGCATTTAC CAAAGATGTT GCCACCGGCA CGATGCAAAT TTATGCTAAT GGCACGTTGT GGGCGCAAGC CACTGGCAAA ACCTTGCCGA TTAAGGCTGT GAGCCAAGCC ACCTTTGGTG CATATAGCCC CAATACAGCG CCTTACTACG GCAAACTTTC CAGCCTACGG ATCTACAATC GCGCTTTGTC GGGCGATGAA ATTCTGCAGG TGATGGCGGC TGATCCAGCG GCGGCAGCAA CTTTCGATAG TTTGTATCCG GTTGGTTTTA ATTTGCTTGA TGATGATGGT CATGCGACGA TGTACATCGA CGATAATCCC CAAGGCTACA ACCTCTTTTT GACCATCAGC AATAACTCGC CGCGCACAAT TAATCTGGTT GCGCCGAATG ATCAAACACC CTCGGCGAGC AATCACCATT TTGTGCTGAA TTTTCGGCCT GGCACACTCT CAACCGCGAC GCTGAGCCAA GTTAAGCTGA CCGACCAAAC GTGGCAACTC AGCACAGGGC AAGCGGTTGA TGGCACGACA AGTTTGTATT TGCTGAGCAC TCAGGCACGG AGTTTGACCC CCAATCAAAG CCTGAATTTG GCCTTACAAT CAATCAGTGC CTCGGCGAGC AGCGGCGCAC GCAATACGCG AGTCGAATTA AGCTACCAGC AATTGCAATA TGCCAACAGC CTCACGCCAT TAATTGGCAG CCGTTTGCAA AACATGGTGG TGTTGAATGC TCAAGGTGCA CATACACCGC CGTTGCATGT AGGCTTTGTT GGGCCAAATA TGGTGCTAAA TAATGGCACG ACGGCGAATC AACTGACCTT ACATATTACC AACCTGCTTG ATGATGCAGC GTTGGCGCTT AGCCCAAGCA CCAGCAATAG CCCAACCACC TTCATCGTAG CCTTTGACAG CCAGGATAAC CAAAATAATC GTGATTGGGC CTTGGGCACG ACCAGCCAAG TTAATGCGAT TGCAATCACG CTTGATAATT GGACAGGTGG CAAAGATCCT GAATCGGCAG AATGGATTTT CACCACGACC CAAACCTCGC TGGCAGCAGA AGCCTTTATT CAAATGAACC TGAGCAACAT CGTTTCATCG CTGCCAAGCG GCGCGGCCAA CCTCTACTTG CTCTATGAAA ATTTGCCAGG CTATCGTGAT GGCTACTTTG TGGTAGCGAT TGAAAAAACG CCATTGATCA TTACTAGCAA TCAACGAGTT GGGGTTGGCA CGAATGCCCC AACGCGGCCA TTGGCGATTC GTGGCAGCGG CGCAGGCGAA GAAGCTATTA GTTTTGAAAA TAGCAGTGGC ACAACCAAAT GGCATATTAA CCAAAAAGTT ACGACCCAAG GTGGCTTCAA TATTGCCGAA ACAGGCGTGA AAGATGGGCG TTTATTTCTC CAAACAGGTG GTAATACTGG AATTGGTACT GTCACACCAC GGACTGGTTT AGATACTGGC ACTGGCGTGC TGAGTGGTGC TGCCAACGAC TATACCAAAG CCCAATTTGC CTTTACTGGC GGTGGTCGTG TAACCTGGAA CGGGATTAAT AATCGCTTGA AATGGAGCCA ACGCTTTATT GCGATTTCGA TGGAACGATC AGTTTCGTTT GCAGGTGGCC ACGTGAATGT CATTCAACCA ACCAGCGATC TTCCCGCGGC CCAAGTCTAT GACAACCAAG TGCGTTCGTG TACTGCCGAT GGAATTGTGC TCAAGGCATG GGAAGCCTTA TATGCGGTGC ACACGGTTGG GGGCAACGAA AATGCCGTCA GCTATCAGAT CACCTGTTAT ACCACAGCCT CGTTCTTTGT GCCGAGCAAT TGGCTGTTGG TGGCGGTAGT TAATGGCGAT GATGGCACAG TGCGTTTGGC TAATGGGCTT GTGCTCAATC CTGGGGCTAG CTCGCTGAAT GGTAGTTCAA TTCCATCGGG CACAATTAAT ATGTGGTCGG GGGCAGATAA TGCCCTACCT GGTGGTTGGC TCTTGTGTAA TGGCCAAAAT GGCACGCCCG ATTTGCGCAA TCGCTTTGTG GTTGGCGCTG GGGCTGCCTA CCCTGTTGGT ACAACTGGTG GTGCGGATAG CGTGACGCTA GCGGTGAATC AGATGCCTTC CCATAACCAT GCAGCCAGTA CATCAAACGA TGGTCAGCAT AACCATACCC TCTACTTCGA TACTGGTGGT GGTGGTAACG GCCCTGGTGG CGATATGGCA AAAACCAATG ATGGCTTGCA AAAAAATGTG ATTGCTAATT TCAGCGTCAA AACCGATAAG GATGGTAATC ACTCACATAG CGTTACGATC CAAAATAATG GTGGCAACCA AGCCCATGAA AATCGCCCGC CCTTCTACGC GCTTTGCTAC ATTATGAAAC AATAG
|
Protein sequence | MTTIDDGLIV YLQLDALSAG QTVLDISGNN NNAIAHGSLQ LVSDDVFGSV LDFDGNGYLE LNPASIPAGN AITVSFWACG AENLPGAGVS AIAALTAADE RSLNVHLPWS DGQVYFDCGA EGNTWDRMNQ AANASDYKGS WTYWAFTKDV ATGTMQIYAN GTLWAQATGK TLPIKAVSQA TFGAYSPNTA PYYGKLSSLR IYNRALSGDE ILQVMAADPA AAATFDSLYP VGFNLLDDDG HATMYIDDNP QGYNLFLTIS NNSPRTINLV APNDQTPSAS NHHFVLNFRP GTLSTATLSQ VKLTDQTWQL STGQAVDGTT SLYLLSTQAR SLTPNQSLNL ALQSISASAS SGARNTRVEL SYQQLQYANS LTPLIGSRLQ NMVVLNAQGA HTPPLHVGFV GPNMVLNNGT TANQLTLHIT NLLDDAALAL SPSTSNSPTT FIVAFDSQDN QNNRDWALGT TSQVNAIAIT LDNWTGGKDP ESAEWIFTTT QTSLAAEAFI QMNLSNIVSS LPSGAANLYL LYENLPGYRD GYFVVAIEKT PLIITSNQRV GVGTNAPTRP LAIRGSGAGE EAISFENSSG TTKWHINQKV TTQGGFNIAE TGVKDGRLFL QTGGNTGIGT VTPRTGLDTG TGVLSGAAND YTKAQFAFTG GGRVTWNGIN NRLKWSQRFI AISMERSVSF AGGHVNVIQP TSDLPAAQVY DNQVRSCTAD GIVLKAWEAL YAVHTVGGNE NAVSYQITCY TTASFFVPSN WLLVAVVNGD DGTVRLANGL VLNPGASSLN GSSIPSGTIN MWSGADNALP GGWLLCNGQN GTPDLRNRFV VGAGAAYPVG TTGGADSVTL AVNQMPSHNH AASTSNDGQH NHTLYFDTGG GGNGPGGDMA KTNDGLQKNV IANFSVKTDK DGNHSHSVTI QNNGGNQAHE NRPPFYALCY IMKQ
|
| |