Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4420 |
Symbol | |
ID | 9342222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4498688 |
End bp | 4501360 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | DSH domain-containing protein |
Protein accession | YP_003722851 |
Protein GI | 298492674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0111717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACTATC CCGCCCCGTC TTCAGAAATT AACTTAGGGT CGATTTTTCC CTTTGAACTG GATCAATTCC AACAGGAAGC GATCGCCTCC TTGAATGCTG GCCGCTCAGT AGTTGTCTGT GCACCCACAG GTTCAGGTAA AACATTAATT GGGGAATACG CCATTTATCG AGCCCTAGCG CGAGGGAAAC GAGTATTTTA TACCACACCC CTCAAAGCGT TATCGAATCA AAAATTACGT GACTTTCGAG AGAAATTCGG GTTTGATCAG GTTGGACTGT TAACCGGGGA TGCCTCCATT AACAGGGATG CCCCAATTTT AGTCATGACC ACCGAAATTT TCCGCAATAT GCTCTATGGC ACACCCATAG GACAAATTGG CATCTCCTTG GTAGATGTGG AGGCTGTTGT CCTGGATGAA TGCCACTACA TGAATGATCG CCAACGGGGT ACGGTGTGGG AAGAATCAAT CATCTATTGT CCCCATGAAG TGCAACTTGT CGCCCTTTCC GCTACTGTTG CTAACAGTGA CCAACTCACC GATTGGCTAA ATCGTGTTCA TGGTCCAACA GACCTGATTT ATTCCGATTT TCGCCCAGTT CCCTTAGAAT TTAACTTTTG CAATCCCAAA GGACTCTTCC CCCTCCTCAA TGAAAGTAAA ACCAAAATTA ACCCCAGGCT GATTAAACGG GGTAAAAAAG GACCAGGGGA AAAAGGCAAA GGCGGTAGGC CAGAAGCACT CAGTATCATT TATACCATCA GCCAATTAGA GCAACGGGAT ATGCTGCCAG CCATTTTCTT TATTTTCAGC CGCCGAGGAT GTGATAAAGC CGTGGCAGAA GTAGGGGATT TATGGTTAGT CAATAATGAC GAATCCCAAA TATTACGCAG ACAGATTGAT GAATTTTTAG CTCGTAACCC AGAAGCAGGA CGTTCTGGAC AAATTGCCCC TTTGTACCGA GGAGTTGCAG CCCACCATGC CGGCATTTTA CCCGCCTGGA AAGTTTTGGT AGAGGAACTA TTTCAACAGG GGCTAATTAA AGTCGTCTTT GCAACGGAAA CCTTGGCAGC GGGAATTAAT ATGCCTGCAC GGACAACGGT AATTTCTACT CTTTCCAAAC GCACCGATAA TGGACACCGG TTGTTGAAGG CTTCGGAATT CCTGCAAATG TCAGGTCGGG CAGGTCGTCG GGGCATGGAT TTACAAGGCT ATGTGGTGAC ATTACAAACT CCCTTTGAGG GGGCAAAAGA AGCTGCATAT TTGGCTACAT CTCCAGCAGA TCCCCTAGTT AGTCAGTTTA CACCCAGCTA TGGGATGGTG CTGAATTTGC TGCAAACCCA TACTTTGGAA CAAGCTAGGG AACTGATAGA ACGCAGTTTT GGACAGTATA TGGCAACGTT GTATTTAAAG CCAGAATACG ACGAAATGGG GGAAATAAAA GCAGAATTAG CAAAAATTCA GGCAGAATTC GCAGCAATTG ATGAAAATGA ATTGGCGCTG TATGAAAAAT TGCGACAACG GCTAAAAGTA GAACGTCATA TTCTAAAAAC TCTCCAAGAA CAAGCACAGA CAGATAGACA AGAACAATTG TCGATGATGC TAGACTTTGC CGTGTCTGGG ACGCTTTTGA GTCTCAAAGA TAAAAGTATG ATAGCAACTT TGCCAATTAC AGCAGTTTTG GTTGAGAAAG CCCCTGATGT TGGTCAAGCT TCTTATTTTG TCTGTTTGGG TCAAGATAAC CGTTGGTATG TGGCAACAGT TGCGGATGTG GTAGATTTGT ATGCTGAATT GCCCAGGGTG GAAGTGTCGC ATGATATTTT ACCACCAGCA GAATTGGCAT TAAAAAGGGG TCAGTGTGTG TGTGGTAATC AAGAAACTGC TGCGATTGCT CAGAGTATAC CCGACCCAGG GGAATTTATG TATATGCCAC CAGAAGTTGT GGAACAACTT GCTCGGTTTA ATGCTGTGCA AGCACAATTA GAAAATCACC CCTTACATCA ATCAGGAAAC ATAGCCAAAA TATTTAAAGA CAGAGCGCGT TGTGTAGAAT TGGAAGCCGA ACTGGAAGAG TTACAAGAAC AGGTAGAACA ACAGTCTCAA AGACATTGGG AACAATTTCT GAATTTAATT CAAATCTTGC AGCAGTTTGG CGGTTTAGAT AACTTAGTAC CCACAACACT GGGACAAATG GCTGCTGCGA TTCGGGGTGA AAATGAATTA TGGTTAGGTT TAGCGATCGC CAGTGGTGAA CTAGATAGTT TAGACCCCCA CCATTTAGCT GCTGCGGCTG CGGCCTTAGT GACAGAAACC CCACGTCCAG ATAGTAAAGT ACATTTTGAC CTCAGTAGTG AAGTAGCTGA TGCTTTGGCA AAGTTGCGGG GGATTCGTCG CCAATTGTTT CAAATACAAC GACGCTATAA TGTGGCACTG CCTATTTGGT TAGAATTTGA ATTAATCGCC ATTATCGAAC AGTGGGCTTT AGGTATGGAT TGGGTACAAC TATGTGCAAA TACCACTTTG GATGAAGGTG ATGTAGTCAG GCTTTTACGC CGGACTCTAG ATCTATTATC CCAGATTCCT CATGTTCCAT TAGTCCCGGA CTCTTTGCGG AAAAATGCTC AACGGGCTAT GCAGTTAATT GATAGATTCC CTGTGAATGA GGCTATGGAG TAA
|
Protein sequence | MNYPAPSSEI NLGSIFPFEL DQFQQEAIAS LNAGRSVVVC APTGSGKTLI GEYAIYRALA RGKRVFYTTP LKALSNQKLR DFREKFGFDQ VGLLTGDASI NRDAPILVMT TEIFRNMLYG TPIGQIGISL VDVEAVVLDE CHYMNDRQRG TVWEESIIYC PHEVQLVALS ATVANSDQLT DWLNRVHGPT DLIYSDFRPV PLEFNFCNPK GLFPLLNESK TKINPRLIKR GKKGPGEKGK GGRPEALSII YTISQLEQRD MLPAIFFIFS RRGCDKAVAE VGDLWLVNND ESQILRRQID EFLARNPEAG RSGQIAPLYR GVAAHHAGIL PAWKVLVEEL FQQGLIKVVF ATETLAAGIN MPARTTVIST LSKRTDNGHR LLKASEFLQM SGRAGRRGMD LQGYVVTLQT PFEGAKEAAY LATSPADPLV SQFTPSYGMV LNLLQTHTLE QARELIERSF GQYMATLYLK PEYDEMGEIK AELAKIQAEF AAIDENELAL YEKLRQRLKV ERHILKTLQE QAQTDRQEQL SMMLDFAVSG TLLSLKDKSM IATLPITAVL VEKAPDVGQA SYFVCLGQDN RWYVATVADV VDLYAELPRV EVSHDILPPA ELALKRGQCV CGNQETAAIA QSIPDPGEFM YMPPEVVEQL ARFNAVQAQL ENHPLHQSGN IAKIFKDRAR CVELEAELEE LQEQVEQQSQ RHWEQFLNLI QILQQFGGLD NLVPTTLGQM AAAIRGENEL WLGLAIASGE LDSLDPHHLA AAAAALVTET PRPDSKVHFD LSSEVADALA KLRGIRRQLF QIQRRYNVAL PIWLEFELIA IIEQWALGMD WVQLCANTTL DEGDVVRLLR RTLDLLSQIP HVPLVPDSLR KNAQRAMQLI DRFPVNEAME
|
| |