Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4775 |
Symbol | |
ID | 9342582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4874969 |
End bp | 4878073 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003723076 |
Protein GI | 298492899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.527155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGTGG ATTATTTACA CCCCCAACAC CTTGAAGAAC TTGTTGAAGA TAGTAGTATA GATTCATACT TAGCACGATT AAATTTTAGA TCGCTGCAAG GTGTGAATGC ATATCAGTAT CTACTCATAT CCGAACAACT CCCACGCACC AATACTGGCA TGATCAAAAG TTCATGGTTG CAACGTTACA GCCATATTAC AGCAGGTGGT TGGTGGTGTT CTGGAAGAGA CCCCCTGAAT AATTGGCAGA AAATGGAATG GGGATGTTTT AAACCAACCC AACCGCGACA AAACAAAGAT GGTAAGTCTA TTAAATATGA ACATCCCCCA AGCACAGCAA CAAGGGTATT TTGTCTGCGT GTAACTTTGC AAATTTGGCA GCAAGTTTCC CAACGGTATA ATATTCCCAT GCCTGAAAAT ATCATCATTA CTGAAGATGG TGAAGCAGAA GGTTTTTGGC AATGGATAAT GGAATGTAAT ATCTCAATAA TTATTTGTGA AGGTGTGAAA AAGGCTGCCG CTTTGTTAAC ACAAGGTTAT CCAGCTATTG CCATTCCGGG AATTACTAGC GGTTATCGAG TTGTAAAAGA TGAATTTGGT AAAGTCACCC GTCGTCAACT CATTCCTGAT TTAGAAGTTT TTGCTACAAG ACAACGAAGT TTTTATATAT GTTTTGATTT TGAAAACCAA GCCAAAAAGA TGGCTGCTGT TAACAATGCA ATTTCTCAAC TTGGTTGTTT ATTTCAACAA CAAGATTGTC CTGTTAAAGT TGTGGAGTTA CCGGGAATAG AAAAAGGTGT TGATGAGTTT ATTGTTGCTA AAGGTGCAGC TAATTTTGAA ATAATTTATC GTCAAAGTGT ATATTTAGAA ATTTACCTTG CTCAAACTAA ACCTCACGGG GAATTAACAA TTACTCCTGC ACTCACTTTT AACCAGCCTT ATTTAGAAAA AATCCCCTTT CCTACTTCTG GCTTGGTAGG AGTGAAATCC CCTAAAGGTA CAGGAAAAAC CACTGGACTG CAAGCAGTTG TCAATCAAGC TAAAAGTCGT AATCAACCAG TTTTATTAAT TACTCATCGC ATTCTTTTAG GAAGATTTTT ATGTGAGAAG ATTGCCATTC AATGGGGAAT TAGCCATCAA GCATGGAGTA TTGAAGAAGA CCCGACATTA CCAATTAGTA GTTATCAATT ACCAATTACT AAATCCTTCG GTTTATGCAT TGATTCTATT TGGAAACTGA ATCCAGAAGA TTGGCATGGA GGCATAGTTA TATTAGATGA AGTAGAACAA TCTTTATGGC ATTTACTTAA TAGTAATACT TGTAAACAGA AGCGGATTAA GATTCTTAGG ATTTTTCAGC AATTGATTGC TACTGTTCTC ACAACTGGGG GTCTAATAAT TGCCCAGGAT GCAGATTTAA CAGATATTTC TTTAGAATAT TTACAAGGTT TAGCAGAGAC TAAAATTACG CCTTGGGTGG TTCTTAACCA ATGGAAACCA CAGCAGGGTT GGGATGTAAC TTTCTATGAT TCCCCTAACC CAACACCTTT AATTCATCAG TTAGAATTAG ATTTAATTGC TGGACGTAAA TGTTATGTAA CTACTGATAG TCGCACTGGA AGTTATAGTT GTGAAACTAT TGAACATTAC CTCAAAGAAA GATTACTTAA ATTAAGAAAG GAATTTCCTA ATACTTTGGT TGTTAGTAGC CATACTACTA ATACACCTGG TCATGCTGCG GTTGATTTCA TCACGGCTAT TAATCAGAAA ATTACAGATT ATAATACTGT ATTTGTTACT CCTAGTTTGG GGACAGGAAT TAGTATTGAT GTCCAACATT TCGACCGCGT TTATGGAATT TTTCAAGGAG TAATTACTGA CTCAGAAGCC CGTCAAGCAT TAGCACGAGT TCGGGATGAT ATACCCCGTG TTGTTTGGTG TGCCAAACGT GGTATTGGTT TAATTGGTAG TGGTAGTACA AATTATCAGT TGTTATCTCA TTGGTATCAA GAAAATCAAA AAGAAAATTT AGCTTTGCTG AGTCCTTTAC ACAAAATAGA TGTAGATCTA CCTATGGTAT ATGACCCTGT GCATTTACGA ACTTGGGCTA AATTATCAGC TAGGGTAAAT GCTTCTATTC GTCTCTATCG GCAATCATTG CAAGATGGTT TAATTGCTGA TGGACATCAA GTTATGATGC GGAGTAATAC AGTCCAAAAT AATATTATTC GGGATTTACG CTTGGCTTTC TTTGCTACTG ATGCTAGTGA TTTAGAAACT AGAAAGAGGT TAATTGTCGA AATTGTCAAA GTCCAGAAAG ATTGGGTGAA AAGTCGTCAA AAAGCTAAAG ATATTAAGCG CAAAATCCAG GAAATTAAAC AACACAATCA ATTATTAGCT GCAAAGGCTG TAGCTAATGC TAGTGATATT GATTATTGTG AATATAATCA ATTATTAAAT AAACATTCTC TTAGTGATAA GGAACGTAAC CAAATAAATA AATATTTGCT CAGAGATATG TATGGGATTG AAGTAACTCC TATGCTAACA TTGCGCGATG ATAAGGGTTA TTATGGACAA TTATTAACTC ACTATTATCT GACCCATGAA AGTGAATATT TCCATGTCAG AGATCAACAA GAGTGGCATA AACAATTATA TTGGGGTGAA GGAAAGGTCT TTTTACCAGA TTTGAGAACG TATACTCTCA AAGTTGAAGC TATGCGAGCA TTAGGTATGT TGGAATTCTT GGAAAATGGT AGAGTATTTA AGGAAAATGA TGCTGATTTG ATTTGGTTGA AGAATGTGGC TGTGCAAAGT AATAAACATA TTAAACGAGC GTTGGGTATT GATGTGGTAC ATGGGAAGGA AGTAGTTTCT GGAATTAAAA TCCTGGGCAG ACTCCTAAAT TTACTAGGTT TGAAGTTACA TCACGTAAAT GATATTTATC AAATTGATTC GCAAACATTA AATGATGGGA GAGGAAATAT ATTTAGTGTT TGGCAACAAC GTGATGAGTT GCGGTTGTAC CATTTGTATG GTGATAATAC TACAATTTTT GATGATTCTT TAAATTCCCA GTCAATGGCA CTGCAAGTAA TGTAA
|
Protein sequence | MLVDYLHPQH LEELVEDSSI DSYLARLNFR SLQGVNAYQY LLISEQLPRT NTGMIKSSWL QRYSHITAGG WWCSGRDPLN NWQKMEWGCF KPTQPRQNKD GKSIKYEHPP STATRVFCLR VTLQIWQQVS QRYNIPMPEN IIITEDGEAE GFWQWIMECN ISIIICEGVK KAAALLTQGY PAIAIPGITS GYRVVKDEFG KVTRRQLIPD LEVFATRQRS FYICFDFENQ AKKMAAVNNA ISQLGCLFQQ QDCPVKVVEL PGIEKGVDEF IVAKGAANFE IIYRQSVYLE IYLAQTKPHG ELTITPALTF NQPYLEKIPF PTSGLVGVKS PKGTGKTTGL QAVVNQAKSR NQPVLLITHR ILLGRFLCEK IAIQWGISHQ AWSIEEDPTL PISSYQLPIT KSFGLCIDSI WKLNPEDWHG GIVILDEVEQ SLWHLLNSNT CKQKRIKILR IFQQLIATVL TTGGLIIAQD ADLTDISLEY LQGLAETKIT PWVVLNQWKP QQGWDVTFYD SPNPTPLIHQ LELDLIAGRK CYVTTDSRTG SYSCETIEHY LKERLLKLRK EFPNTLVVSS HTTNTPGHAA VDFITAINQK ITDYNTVFVT PSLGTGISID VQHFDRVYGI FQGVITDSEA RQALARVRDD IPRVVWCAKR GIGLIGSGST NYQLLSHWYQ ENQKENLALL SPLHKIDVDL PMVYDPVHLR TWAKLSARVN ASIRLYRQSL QDGLIADGHQ VMMRSNTVQN NIIRDLRLAF FATDASDLET RKRLIVEIVK VQKDWVKSRQ KAKDIKRKIQ EIKQHNQLLA AKAVANASDI DYCEYNQLLN KHSLSDKERN QINKYLLRDM YGIEVTPMLT LRDDKGYYGQ LLTHYYLTHE SEYFHVRDQQ EWHKQLYWGE GKVFLPDLRT YTLKVEAMRA LGMLEFLENG RVFKENDADL IWLKNVAVQS NKHIKRALGI DVVHGKEVVS GIKILGRLLN LLGLKLHHVN DIYQIDSQTL NDGRGNIFSV WQQRDELRLY HLYGDNTTIF DDSLNSQSMA LQVM
|
| |