Gene Aazo_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0033 
Symbol 
ID9337816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp28730 
End bp31201 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content44% 
IMG OID 
Productheterocyst differentiation protein HetF 
Protein accessionYP_003719813 
Protein GI298489636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCAGG AATTTCACAT TTCCGTAACC CCAGTAGGGC AGAATGACTA CTTGGTGCGG 
ACGGAACAAG TCGCACCTGG GGTGCCTTTG GCAGAAGAAC TGGTAACTTG GCCTGTGGCT
GAGTGGTTAG CAGCAGCAGG ACATTTGATG AATGACCCTT TACAGTTGGT GCTACAGGGG
GATGCGATTG CCAGAAACTC TGTAAACTTG GTGGCACTGG GTCAGCAATT ATACAATGCA
CTGTTTCAAG GCACTCTGAG AGATAGTTGG ATTACTGCCC AAGGTATTGC CCAGAATCAG
CAACAGGTAT TGCGCTTACG TTTGGGACTC AAAGATACTA AGTTAGCGCG GTTGCCGTGG
GAAGTGATGC ACGCAGGCGA TCGCCCTTTG GCAACGGGTC CTTATATCGC TTTTTCCCGC
TACCAAAGTG GAATTCCTTC CACATCAAGA CTACCAACTC GCAATCTTCC TACACAACAG
GAAGATTACG GGGTAAAAGT GTTGATGATC ATCTCTTCCC CTACTGATCA AGTGCATCTT
GATTTGCTCA AACAAGAAGC TATTAACCTG CAAACAGAAC TACATCGCCA AAATCCCCGT
GGAGAAAATG ATCATTATCT GCCAGAAATT GACCTGACAG TTCTTGATCA ACCAGGAAGG
GAAGAGTTAA CCCAAGCTTT AGAACAAGGA CGATATCAAG TTCTTCATTA CTCTGGTCAT
AGTAATGTGG GTGCGAACGG GGGAGAAATT TATTTAGTTA GTAGAAGAAC TGGTTTAACA
GAAACGCTTA GCGGTGATGA CTTAGCAGGG TTGCTGATTA ATAATAATAT TCAAATGGCG
GTGTTTAACT CCTGTTTGGG CGCATATAGG GCAAAATCTG AGACATCTGG GGATACAGGA
GAACGCAACT TAACTGAAAG CTTGGTAAAA AGGGGTCTTA AAAGCGTTTT GGCAATGTCA
GAGCGCATTC CTGATGAAGT GGCGCTGACG CTAACACAAT TGTTCTACCG CAACTTAAGT
CAGGGCTATC CTTTGGATTT GTGTGTCAGT CGGGTGCGTC AGGGCTTAAT TTCTGCCTAT
GGTTCTCACC AGATGTACTG GGCGTTACCG ATACTGTATC TGCATCGAGA ATTTGAAGGT
TTTCTATCTC CGCAAATTAG TTTCTCTGGC AATGGGGAGT TGCTAAATGA ATATCAGTCA
CCAATGGGAA CATCTGCTAA TTCAATGTAT GCTTCAGCAG ATGATTTAGA AATACCTTTA
AATTTCGAGG ATATTATGCC TTCGGATTTG TCTAGGGAGT CTTCTGAATT GGATTGGCTG
GGGGATGATA CTTGGGGGGA TCTTTTAGAT GAAGTGGATT ATGAGCACCC TGATTATGAT
GAGGATTCGG CGATAGTTGC GGATTTATTC CGGCAGTTAG GTAATCCACA AGTCCTCACT
GAAGAAACTT CCATGAAGGC GGAATTAGAA TCATCTGTAC GAGAGAATCA TCGTCAGGAA
ATACAGGCTT CAGAGAAGCA AGAGGACATA GATTTTTGGG GAGATAAATC AACTTCAAGA
ACTCCAAATT TGTCGGATAC TGAAGGTAAT CAGGTCGTTT CTCCACAGAT TGCATTACTA
CCAATTCGCC AACCACGTCG CCTTCGTCGG CGATGGCCGA TGGCTGGGAT TATTGGTGTA
AGTGTGTTAG CTGCCATTAT TGGTTTAATT TGGTGGTCTA ATCATCGCCA GTCGAGGGGT
GTCAAAATTC CACCTATTCC TGTCCAGAAT GGGCCTCAAA ACAAAGCAGT AGCTGTTGAC
TTCAAAAAAT CATCAACAGG TATTGTTACT GCTACGGCAA CAGAACAATT GAGTCAAGGT
AATTTACAGC CAGGGTTAGA AGCTGTGGAG GAATTGCTGA ATCGTGGGGC TTTAGCCTCA
GCAGATACGG CTTTAGCTTT AATTCCCACC AAGCAAACGG AAGATCCTGC TGTTAACTTT
ATGCGGGGAA GATTGGCTTG GCAGTTTGCC CAGACGAGAG AGACAAAATA CAGTATTAAT
GATGCCCGTC GTTATTGGGA AATTGCTGTG CGGGATCAAC CTAATTCTGT TTTGTATAAT
AACGCAGTGG GATTTGCCTA TTATGCTGAA GGAAATCTGA ATCGGGCTAA TGATGCGTGG
TTCAAAGCTG TGAATTTAGC ACTGAAAAAC CAGAATACAA ACTTCCCATT AACACCAAAT
GCTAATAGTA TTATGCCACC AGAGGCTTTA ACTGCCTATG CAGGGTTAGC TTTGGGTTTA
TATAAATCTG TAAATAATCA ACCTTCTGCT AAACAAGCGC AATATATGAA TGAGGCGATT
AAGCTGCGTC AACTGGTAAT TCAGAATGAT CCGGTAAGTT TTACAGTAGA TAAGTTAGCT
CAAAATTGGC TGTGGACAGA AAAGGCGATC GCAGATTGGC GATCGCTCCT TCAGCAACAA
AAAAAGAGGT GA
 
Protein sequence
MTQEFHISVT PVGQNDYLVR TEQVAPGVPL AEELVTWPVA EWLAAAGHLM NDPLQLVLQG 
DAIARNSVNL VALGQQLYNA LFQGTLRDSW ITAQGIAQNQ QQVLRLRLGL KDTKLARLPW
EVMHAGDRPL ATGPYIAFSR YQSGIPSTSR LPTRNLPTQQ EDYGVKVLMI ISSPTDQVHL
DLLKQEAINL QTELHRQNPR GENDHYLPEI DLTVLDQPGR EELTQALEQG RYQVLHYSGH
SNVGANGGEI YLVSRRTGLT ETLSGDDLAG LLINNNIQMA VFNSCLGAYR AKSETSGDTG
ERNLTESLVK RGLKSVLAMS ERIPDEVALT LTQLFYRNLS QGYPLDLCVS RVRQGLISAY
GSHQMYWALP ILYLHREFEG FLSPQISFSG NGELLNEYQS PMGTSANSMY ASADDLEIPL
NFEDIMPSDL SRESSELDWL GDDTWGDLLD EVDYEHPDYD EDSAIVADLF RQLGNPQVLT
EETSMKAELE SSVRENHRQE IQASEKQEDI DFWGDKSTSR TPNLSDTEGN QVVSPQIALL
PIRQPRRLRR RWPMAGIIGV SVLAAIIGLI WWSNHRQSRG VKIPPIPVQN GPQNKAVAVD
FKKSSTGIVT ATATEQLSQG NLQPGLEAVE ELLNRGALAS ADTALALIPT KQTEDPAVNF
MRGRLAWQFA QTRETKYSIN DARRYWEIAV RDQPNSVLYN NAVGFAYYAE GNLNRANDAW
FKAVNLALKN QNTNFPLTPN ANSIMPPEAL TAYAGLALGL YKSVNNQPSA KQAQYMNEAI
KLRQLVIQND PVSFTVDKLA QNWLWTEKAI ADWRSLLQQQ KKR