Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1951 |
Symbol | |
ID | 9339744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 2030564 |
End bp | 2033530 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003721160 |
Protein GI | 298490983 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTGGA AATGGTGCTT TAGAGTTATT ATCGCTTTGT TGGGGCTGTG GCTATTCTGG GATATAGTTT CCCATCTAGG GGCAGAGATT TTTTGGTTTC AGGAAGTTGG CTATTTGCAA ACATTTCTGC TGCGCTTGGT GACTAAGGGT GCTTTATGGG TGGGTGTGGT CAGTGTTAGT TTTACCTATC TGTTTTTTAA TCTGGCTTTA GCACAACGCT GGAAATATCC CCAGTCTCTG AAAACTGAGT TGGTCAGAAA TGAGGAGACA AGACTGAGTA AGCAACTCAC AAAATTTCTC AGTCCACAGT ATGGCACAGT CTATAAACCT CCTTATGTTG AGACTGGATA CAAAGAAATT AGATTGCGCT GGCTGTTACC ACTGACGTTA GGCTTGAGTT TGTTGGTGGG GTTGATGGTA ACCCATTATG GACAAGTCGC TCTAAGTTAC TGGAATAGAG AAGTTAATCA GGTTATTTCA CCTTTTACAA TTCTATTTAG ACCAGAGATT CTTTGGAATT TCGGACACAC AATTATTTTT CAGGACTGGT ATTTTGGTGT GGCTTTAGTA ATGGCGATCG CACTCTTAAT ATATTCTCAC TTTTTGCTGA GAGCGATCGC ACTTATTTTT AGTCTTGGTT TTGGCTGGAT ACTATCTCAA CATTGGTGTA AAATACTACT TTATTTCCAC TCTACTCCCT TCAATACCAA CGACCCTTTA TTTGGCAAAG ATATCGGTTT TTATATTTTT TCACTCCCTC TGTGGGAACT TTTGGCATTT TGGCTACTTG GATTATGTTT ATATGGCTTT GTGTCCGTTG GTCTCACCTA TCTTTTATCA GGAGATAGTC TAAGTCAAGG CATTTTTCCT GGTTTTTCTC TACCACAACA ATGTCATTTA TTCGGTTTGG GTAGCTGTTT GATGTTGGTA GTAGCCTTTA GTTATTGGTT AGGTCGTTAT GAACTGGTTT ATTCTCCCCG TGGGGTGAGT TTTGGCGCTA GTTACACCGA TGCTAAGATC CAGTTGCCAG TTGACACTAT GTTATGTGTT TTAGCTGTAG TGCTCGCATT TTATTTATTA TGGCAAACTT TCTTCTGGAA ACCCAAATCG CAACATCATC GTTGGGCAAT TTATAGTTTA TGTATTTATA TAATTTTAAT AATTACAGGT GATTTTCTTG TACCTGCTGT TGTCCAATCT TTAATAGTTC AACCCAATGA ATTGCAGCGA GAGAAGCCTT ACATCCAACG TACTATTAAC TTTACTCGTC AAGCATTTGA TTTAGAAGCA GTTGATTCTC AAACCTTTAA TCCCCAGGGA AACTTAACCC AAGCTGATCT CAAAGTCAAT GATCTAACAA TTCGTAATAT TCGTCTTTGG GATCAAGAAC CATTATTAAA AACTAACCGC CAATTACAAC AAATTCGTCC CTATTATCAA TTTCCCGATG CTGATATTGA TCGTTACACA ATAAAAATTG AACCCAACCA AGAAGCAACC GCATCAACCG AAAAACGTCA GGTACTAATT GCAGCCAGGG AATTAGACTA TAATTCTGTT CCACAGCAAG CTAAAACATG GGTAAACCGC AATTTAGTTT ACACCCACGG TTATGGCTTT ACCATGAGTC CTGTGAATAC TGTTGCTCCT GGTGGTTTAC CAGAATATTT TGTTAAAGAC ATTAGCGGTA ATGATAGTGC GCTAACTACT TCTAGTGAAG CTATTCGTGA AAGTATTCCT ATTGGTAAAC CGCGAATTTA TTATGGAGAA ATTAGCAATA CTTATGTCAT GACTGGGACG AAAGTTAGAG AATTAGATTA TGCCAGTGGC AGAGATAATG TTTACACCAG TTATGATGGC GTGGGTGGAA TTAGAATTGG TTCTCTGGGT CGAAGATGGC TATTTGCTAC ATATTTGAAA GACTGGCAAA TGATATTTAC ACGGAATTTT CTACCAGAGA CAAAAGTATT ATTCCGGCGA AATATTAACC AACGAGTTCG TGCGCTCGCA CCTTTTCTAA AATTTGACAG TGAGCCTTAC TTAGTAGCTG CTGATGCCAA TCCTGACAAC AAAAATGAAC AATTCCCGGC AACAAAGAAT AATCTTTACT GGATTATAGA TGCTTATACA ACAAGTGACC GCTATCCCTA CTCTGACCAA AATGGTGATG GCATCAACTA TATTCGTAAC TCCGTTAAGG TCGTAATTGA CGCTTACAAC GGCACAGTCA GATTTTACGT TGCTGAACCA AAAGAGCCTC TAATTATTGC CTGGTCGAAA ATATTCCCAC AAATGTTCCA ACCGCTGGCA AATATGCCTG TTAATCTCCA GAGTCATATC CGCTATCCAG TGGACTTTTT TAAAATTCAA TCTGAACGGT TAATGGTTTA TCATATCACT AACCCCCAAG TATTTTACAA CCGCGAAGAT CAATGGCAAA TTCCCAATGA AATATATGGA ACGCAAACCC GTCAGGTTGA ACCTTACTAT TTAATTACTA GTCTTCCTAA CGTTCCCTTT GAAGAATTTA TTTTACTGCT TCCCTACACA CCCAAACAAC GAACAAATTT AATTGCTTGG TTAGCAGCGC GTTCAGATGG AGAAAATTAC GGTAAATTGC TGCTGTATAA CTTTCCTAAA GAACGGTTGA TTTATGGGAC AGAACAAATA GAAGCGAGAA TTAACCAAGA TCCAGTAATT TCTCAACAAA TTTCTTTATG GAATCGTCAA GGTTCGAGAG CAATTCAAGG TAATCTTTTG ATTATTCCCA TTGAACAATC TTTGTTATAT GTTGAGCCAA TTTATTTAGA AGCAACTCAG AATAGTTTAC CAACTTTGGT TAGGGTGGTT GTTGCTTATG AAAATCGCAT TATCATGGCA AAAACTTTAG AACAAGCATT ACAAGGAATT TTTCAACCAG AGGTAACACA AGCACCGACA ATTATTCGTC CTGTGGAACA AGAATGA
|
Protein sequence | MSWKWCFRVI IALLGLWLFW DIVSHLGAEI FWFQEVGYLQ TFLLRLVTKG ALWVGVVSVS FTYLFFNLAL AQRWKYPQSL KTELVRNEET RLSKQLTKFL SPQYGTVYKP PYVETGYKEI RLRWLLPLTL GLSLLVGLMV THYGQVALSY WNREVNQVIS PFTILFRPEI LWNFGHTIIF QDWYFGVALV MAIALLIYSH FLLRAIALIF SLGFGWILSQ HWCKILLYFH STPFNTNDPL FGKDIGFYIF SLPLWELLAF WLLGLCLYGF VSVGLTYLLS GDSLSQGIFP GFSLPQQCHL FGLGSCLMLV VAFSYWLGRY ELVYSPRGVS FGASYTDAKI QLPVDTMLCV LAVVLAFYLL WQTFFWKPKS QHHRWAIYSL CIYIILIITG DFLVPAVVQS LIVQPNELQR EKPYIQRTIN FTRQAFDLEA VDSQTFNPQG NLTQADLKVN DLTIRNIRLW DQEPLLKTNR QLQQIRPYYQ FPDADIDRYT IKIEPNQEAT ASTEKRQVLI AARELDYNSV PQQAKTWVNR NLVYTHGYGF TMSPVNTVAP GGLPEYFVKD ISGNDSALTT SSEAIRESIP IGKPRIYYGE ISNTYVMTGT KVRELDYASG RDNVYTSYDG VGGIRIGSLG RRWLFATYLK DWQMIFTRNF LPETKVLFRR NINQRVRALA PFLKFDSEPY LVAADANPDN KNEQFPATKN NLYWIIDAYT TSDRYPYSDQ NGDGINYIRN SVKVVIDAYN GTVRFYVAEP KEPLIIAWSK IFPQMFQPLA NMPVNLQSHI RYPVDFFKIQ SERLMVYHIT NPQVFYNRED QWQIPNEIYG TQTRQVEPYY LITSLPNVPF EEFILLLPYT PKQRTNLIAW LAARSDGENY GKLLLYNFPK ERLIYGTEQI EARINQDPVI SQQISLWNRQ GSRAIQGNLL IIPIEQSLLY VEPIYLEATQ NSLPTLVRVV VAYENRIIMA KTLEQALQGI FQPEVTQAPT IIRPVEQE
|
| |