Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0033 |
Symbol | |
ID | 9337816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 28730 |
End bp | 31201 |
Gene Length | 2472 bp |
Protein Length | 823 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | heterocyst differentiation protein HetF |
Protein accession | YP_003719813 |
Protein GI | 298489636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCAGG AATTTCACAT TTCCGTAACC CCAGTAGGGC AGAATGACTA CTTGGTGCGG ACGGAACAAG TCGCACCTGG GGTGCCTTTG GCAGAAGAAC TGGTAACTTG GCCTGTGGCT GAGTGGTTAG CAGCAGCAGG ACATTTGATG AATGACCCTT TACAGTTGGT GCTACAGGGG GATGCGATTG CCAGAAACTC TGTAAACTTG GTGGCACTGG GTCAGCAATT ATACAATGCA CTGTTTCAAG GCACTCTGAG AGATAGTTGG ATTACTGCCC AAGGTATTGC CCAGAATCAG CAACAGGTAT TGCGCTTACG TTTGGGACTC AAAGATACTA AGTTAGCGCG GTTGCCGTGG GAAGTGATGC ACGCAGGCGA TCGCCCTTTG GCAACGGGTC CTTATATCGC TTTTTCCCGC TACCAAAGTG GAATTCCTTC CACATCAAGA CTACCAACTC GCAATCTTCC TACACAACAG GAAGATTACG GGGTAAAAGT GTTGATGATC ATCTCTTCCC CTACTGATCA AGTGCATCTT GATTTGCTCA AACAAGAAGC TATTAACCTG CAAACAGAAC TACATCGCCA AAATCCCCGT GGAGAAAATG ATCATTATCT GCCAGAAATT GACCTGACAG TTCTTGATCA ACCAGGAAGG GAAGAGTTAA CCCAAGCTTT AGAACAAGGA CGATATCAAG TTCTTCATTA CTCTGGTCAT AGTAATGTGG GTGCGAACGG GGGAGAAATT TATTTAGTTA GTAGAAGAAC TGGTTTAACA GAAACGCTTA GCGGTGATGA CTTAGCAGGG TTGCTGATTA ATAATAATAT TCAAATGGCG GTGTTTAACT CCTGTTTGGG CGCATATAGG GCAAAATCTG AGACATCTGG GGATACAGGA GAACGCAACT TAACTGAAAG CTTGGTAAAA AGGGGTCTTA AAAGCGTTTT GGCAATGTCA GAGCGCATTC CTGATGAAGT GGCGCTGACG CTAACACAAT TGTTCTACCG CAACTTAAGT CAGGGCTATC CTTTGGATTT GTGTGTCAGT CGGGTGCGTC AGGGCTTAAT TTCTGCCTAT GGTTCTCACC AGATGTACTG GGCGTTACCG ATACTGTATC TGCATCGAGA ATTTGAAGGT TTTCTATCTC CGCAAATTAG TTTCTCTGGC AATGGGGAGT TGCTAAATGA ATATCAGTCA CCAATGGGAA CATCTGCTAA TTCAATGTAT GCTTCAGCAG ATGATTTAGA AATACCTTTA AATTTCGAGG ATATTATGCC TTCGGATTTG TCTAGGGAGT CTTCTGAATT GGATTGGCTG GGGGATGATA CTTGGGGGGA TCTTTTAGAT GAAGTGGATT ATGAGCACCC TGATTATGAT GAGGATTCGG CGATAGTTGC GGATTTATTC CGGCAGTTAG GTAATCCACA AGTCCTCACT GAAGAAACTT CCATGAAGGC GGAATTAGAA TCATCTGTAC GAGAGAATCA TCGTCAGGAA ATACAGGCTT CAGAGAAGCA AGAGGACATA GATTTTTGGG GAGATAAATC AACTTCAAGA ACTCCAAATT TGTCGGATAC TGAAGGTAAT CAGGTCGTTT CTCCACAGAT TGCATTACTA CCAATTCGCC AACCACGTCG CCTTCGTCGG CGATGGCCGA TGGCTGGGAT TATTGGTGTA AGTGTGTTAG CTGCCATTAT TGGTTTAATT TGGTGGTCTA ATCATCGCCA GTCGAGGGGT GTCAAAATTC CACCTATTCC TGTCCAGAAT GGGCCTCAAA ACAAAGCAGT AGCTGTTGAC TTCAAAAAAT CATCAACAGG TATTGTTACT GCTACGGCAA CAGAACAATT GAGTCAAGGT AATTTACAGC CAGGGTTAGA AGCTGTGGAG GAATTGCTGA ATCGTGGGGC TTTAGCCTCA GCAGATACGG CTTTAGCTTT AATTCCCACC AAGCAAACGG AAGATCCTGC TGTTAACTTT ATGCGGGGAA GATTGGCTTG GCAGTTTGCC CAGACGAGAG AGACAAAATA CAGTATTAAT GATGCCCGTC GTTATTGGGA AATTGCTGTG CGGGATCAAC CTAATTCTGT TTTGTATAAT AACGCAGTGG GATTTGCCTA TTATGCTGAA GGAAATCTGA ATCGGGCTAA TGATGCGTGG TTCAAAGCTG TGAATTTAGC ACTGAAAAAC CAGAATACAA ACTTCCCATT AACACCAAAT GCTAATAGTA TTATGCCACC AGAGGCTTTA ACTGCCTATG CAGGGTTAGC TTTGGGTTTA TATAAATCTG TAAATAATCA ACCTTCTGCT AAACAAGCGC AATATATGAA TGAGGCGATT AAGCTGCGTC AACTGGTAAT TCAGAATGAT CCGGTAAGTT TTACAGTAGA TAAGTTAGCT CAAAATTGGC TGTGGACAGA AAAGGCGATC GCAGATTGGC GATCGCTCCT TCAGCAACAA AAAAAGAGGT GA
|
Protein sequence | MTQEFHISVT PVGQNDYLVR TEQVAPGVPL AEELVTWPVA EWLAAAGHLM NDPLQLVLQG DAIARNSVNL VALGQQLYNA LFQGTLRDSW ITAQGIAQNQ QQVLRLRLGL KDTKLARLPW EVMHAGDRPL ATGPYIAFSR YQSGIPSTSR LPTRNLPTQQ EDYGVKVLMI ISSPTDQVHL DLLKQEAINL QTELHRQNPR GENDHYLPEI DLTVLDQPGR EELTQALEQG RYQVLHYSGH SNVGANGGEI YLVSRRTGLT ETLSGDDLAG LLINNNIQMA VFNSCLGAYR AKSETSGDTG ERNLTESLVK RGLKSVLAMS ERIPDEVALT LTQLFYRNLS QGYPLDLCVS RVRQGLISAY GSHQMYWALP ILYLHREFEG FLSPQISFSG NGELLNEYQS PMGTSANSMY ASADDLEIPL NFEDIMPSDL SRESSELDWL GDDTWGDLLD EVDYEHPDYD EDSAIVADLF RQLGNPQVLT EETSMKAELE SSVRENHRQE IQASEKQEDI DFWGDKSTSR TPNLSDTEGN QVVSPQIALL PIRQPRRLRR RWPMAGIIGV SVLAAIIGLI WWSNHRQSRG VKIPPIPVQN GPQNKAVAVD FKKSSTGIVT ATATEQLSQG NLQPGLEAVE ELLNRGALAS ADTALALIPT KQTEDPAVNF MRGRLAWQFA QTRETKYSIN DARRYWEIAV RDQPNSVLYN NAVGFAYYAE GNLNRANDAW FKAVNLALKN QNTNFPLTPN ANSIMPPEAL TAYAGLALGL YKSVNNQPSA KQAQYMNEAI KLRQLVIQND PVSFTVDKLA QNWLWTEKAI ADWRSLLQQQ KKR
|
| |