Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4831 |
Symbol | |
ID | 9342638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4941514 |
End bp | 4944450 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | excinuclease ABC subunit A |
Protein accession | YP_003723110 |
Protein GI | 298492933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAC ACCTAGCAAC ATCTTTAAAT CATCGTCTTC CTTACCCCGA CAACAGCCAG AATACTATTC GGATTCGGGG TGCTAGGCAG CATAATCTGA AAAATATTGA TTTAGAACTA CCTCGCGATC GCTTGATTGT CTTTACCGGC GTTTCTGGTT CTGGTAAGTC GTCTTTGGCG TTTGATACCA TCTTTGCAGA AGGTCAGCGG CGTTATGTGG AATCTCTCAG CGCCTATGCT CGGCAATTTT TGGGACAGTT GGATAAACCT GATGTGGAAG CGATTGAAGG TTTAAGTCCC GCGATTTCCA TTGATCAAAA ATCAACTTCC CATAATCCTC GTTCGACGGT GGGAACGGTA ACGGAAATTT ATGATTATCT GCGCTTGTTA TATGGTCGTG CTGGTGAACC CCATTGTCCT AAATGCGATC GCTGTATTTC TCCCCAAACC ATTGACGAGA TGTGCGATCG CATCATGGAA CTTCCAGACC GCACCCGTTT CCAAATTCTC GCACCTGTCG TCCGGGGGAA AAAGGGAACT CACGGCAAAT TATTATCTAG TTTAGCCGCA CAAGGATTTG GCAGGATTAA GGTCAATGGA GAGGTTTTAG AACTATCTGA CTTCATTGAG TTAGATAAAA ATATTACACA TAATATAGAA CTGGTTATTG ATAGGTTAGT TAAAAAAGAC GGTATTCAAG AGCGTTTAGT TGATTCTCTC ACCACCTGCC TTAACCAATC CAACGGGATT TCTATAATCG AAGTATTAAA TCATACCTGC CATAAAGTAA AGCAGGAGGT ACCACATAGT AAGTATATAT CAAATGGGGG AGAAAATCAA GATATTTCCG CACAAGAATT AGCATCAGAA ATGGTATTTT CAGAAAACTT TGCTTGTCCA GAACATGGTG CAATTATGGA AGAATTATCA CCACGTTTGT TTTCTTTTAA TTCTCCCTAT GGTGCTTGTT CCCATTGTCA TGGCATTGGG ACTTTAAAAA GATTTTCTCC AGAACTGGTC ATACCTAACC AAGATTCGCC CATGTATGCT GCGCTCGCAC CTTGGTCAGA AAAAGAAAAT TCCTACTATT TAGAATTACT GTATAGTTTA GGACAAGCCT ATGGATTTGA ATTACAGACC CAGTGGAGCA AATTGACACC AGAACAGCAG CAGATAATTT TATATGGAGA AGAAAAACCC GCAGAAGGGA AAAAACAAGC CTTTAAAGGA GTAGTTCCCA TCTTACAAAG ACAATATGAA GGGGGCACAG AATTAGTTAA ACAGAAACTA GAAGAATATT TAATCGATCA ACCTTGTGAA GTTTGTGGAG GAAAGCGGTT AAAACCCGAA GCCTTAGCCG TGAAATTAGG ACAATATGGA ATTTCAGATT TAACCAGCGT CTCCATCCGC GAATGTCGAG AAAGAATAGA CCGATTGAAA TTGACCCCAC GACAAATACA AATTGGCGAT TTAGTTCTCA AAGAAATTAA AGCCAGATTG CAGTTTTTAT TAGATGTCGG GTTAGATTAT CTCACGTTAG ATCGTCCCGC CATGACATTA TCAGGTGGAG AAGGCCAACG AATTCGGTTA GCAACACAAA TTGGTTCTGG ATTAACAGGA GTTCTCTACG TTTTAGACGA ACCAAGCATC GGTTTACATC AAAGAGATAA CGGACGTTTA TTGAAAACTT TAACCAAATT ACGCGACTTA GGAAATACAT TAATAGTCGT CGAACATGAT GAAGAAACCA TACGTGCAGC TAATTATATA GTAGATATTG GTCCTGGTGC AGGAATTCAT GGCGGAAATA TTATTTCTCA GGGAAATTTA GAAAATTTAT TAAATGCAGA AGCTTCTTTA ACCGGTGCTT ACCTATCAGG AAGAAGAGTC ATTAATACCC CAGTCTCTAG AAGAGAAGGA AATGGCAGAA CTTTAATAAT TAGAAACGCT CATCGCAACA ACCTCAGAAA TATAGATGTC GAAATTCCTT TAGGTAAACT TGTCGCTGTC ACCGGTGTTT CTGGTTCAGG AAAATCTACC TTAATCAATG AACTACTTTA TCCAGCATTA CAACATCATC TTTTAAAAAG AATTCCCTTT CCCAAAGATA TTGATGAAAT TAAAGGTTTA AACTGCGTTG ACAAAGCCAT AGTCATCGAC CAATCACCAA TTGGTAGAAC TCCCCGTTCT AACCCAGCAA CCTACACCGG AGTCTTCGAT ATCATTCGAG ATGTGTTTTC CCAAACTATC GAAGCCAAAA CCAGAGGCTA CAAACCGGGA CAATTTTCTT TCAACGTTAA AGGTGGAAGA TGTGAAGCTT GTAGCGGACA AGGTGTAAAC GTCATTGAAA TGAACTTTTT ACCTGACGTT TATGTACAGT GTGAAATTTG CAAAGGTGCG AGATACAACC GGGAAACATT ACAAGTTAAA TATAAAGATA AATCAATTTC CGACGTTCTC AATATGACCG TTGAAGAAGC ATTAGCATTT TGTGAAAACA TTCCCAAAGC TGTCACCAGA TTACAAACAT TAGTGGATGT GGGTTTAGGT TATGTGCAGT TAGGACAACC AGCCACAACT TTATCTGGTG GAGAAGCACA ACGGGTTAAA TTAGCAACAG AATTATCTCG ACGTGCAACA GGGAAGACTC TTTATTTAAT AGATGAACCA ACAACAGGAT TGTCTTTTTA CGACGTACAT AAATTGTTGG ATGTTTTACA AAAATTGGTG GATAAAGGAA ATTCGATTTT AGTCATTGAA CATAATTTAG ATGTGATTCG TTGTTCTGAT TGGGTAATAG ATTTGGGACC AGAAGGAGGA GATCAAGGAG GACAAATTAT TATTGCAGGT ACACCGGAAG ATGTGGCGGA AAATCAACGT TCTTATACTG GGGAATATTT AAGGCAGGTG TTGAAGCAAT ATTCGGCAGT TGTCTAA
|
Protein sequence | MSEHLATSLN HRLPYPDNSQ NTIRIRGARQ HNLKNIDLEL PRDRLIVFTG VSGSGKSSLA FDTIFAEGQR RYVESLSAYA RQFLGQLDKP DVEAIEGLSP AISIDQKSTS HNPRSTVGTV TEIYDYLRLL YGRAGEPHCP KCDRCISPQT IDEMCDRIME LPDRTRFQIL APVVRGKKGT HGKLLSSLAA QGFGRIKVNG EVLELSDFIE LDKNITHNIE LVIDRLVKKD GIQERLVDSL TTCLNQSNGI SIIEVLNHTC HKVKQEVPHS KYISNGGENQ DISAQELASE MVFSENFACP EHGAIMEELS PRLFSFNSPY GACSHCHGIG TLKRFSPELV IPNQDSPMYA ALAPWSEKEN SYYLELLYSL GQAYGFELQT QWSKLTPEQQ QIILYGEEKP AEGKKQAFKG VVPILQRQYE GGTELVKQKL EEYLIDQPCE VCGGKRLKPE ALAVKLGQYG ISDLTSVSIR ECRERIDRLK LTPRQIQIGD LVLKEIKARL QFLLDVGLDY LTLDRPAMTL SGGEGQRIRL ATQIGSGLTG VLYVLDEPSI GLHQRDNGRL LKTLTKLRDL GNTLIVVEHD EETIRAANYI VDIGPGAGIH GGNIISQGNL ENLLNAEASL TGAYLSGRRV INTPVSRREG NGRTLIIRNA HRNNLRNIDV EIPLGKLVAV TGVSGSGKST LINELLYPAL QHHLLKRIPF PKDIDEIKGL NCVDKAIVID QSPIGRTPRS NPATYTGVFD IIRDVFSQTI EAKTRGYKPG QFSFNVKGGR CEACSGQGVN VIEMNFLPDV YVQCEICKGA RYNRETLQVK YKDKSISDVL NMTVEEALAF CENIPKAVTR LQTLVDVGLG YVQLGQPATT LSGGEAQRVK LATELSRRAT GKTLYLIDEP TTGLSFYDVH KLLDVLQKLV DKGNSILVIE HNLDVIRCSD WVIDLGPEGG DQGGQIIIAG TPEDVAENQR SYTGEYLRQV LKQYSAVV
|
| |