Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2072 |
Symbol | |
ID | 9339866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2155543 |
End bp | 2157282 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | |
Product | single-stranded nucleic acid-binding R3H domain-containing protein |
Protein accession | YP_003721243 |
Protein GI | 298491066 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATTA CAGACGATCT CCAAAAGTTA CTGGACATTT TGCCCCAAGA CCTGCGGCAA GAACTAGAAA ATCATCCCAA AAGAGATTAT CTAGTGGAAG TGGTCTTGGA CTTAGGTCGT CGCCCAGAAG CTCGGTTTCC CCATGCAGCT GAGTATCTGA GCGAAACCCC CGTCACTCAA GCACAAATAG ATGATTGCAT TCAACGAGTC GGAACCTTTG GTGGAGATAA TCGGGCAGGA ATTGAGCAAA CTTTACATCG CATCAGTGCT ATCCGTAACC GCACAGGCAA AATTATTGGT TTAACTTGTC GTGTCGGCCG GGCGGTATTT GGCACAATTG GCATGATCCG CGATTTGGTA GAAACCGGCA AATCCATTCT CATGTTAGGT TGTCCTGGTG TGGGTAAAAC CACTGCTTTA CGGGAAATTG CCCGTGTATT GGCAGATGAT TTAAATAAAC GAGTGGTAAT TATTGACACC TCTAATGAAA TCGCTGGTGA TGGTGATATA GCTCACTCTG CCATTGGCCG CGCTCGCCGG ATGCAAGTAG CAAAGCCAGA ATTACAACAT CAAGTGATGA TTGAGGCAGT GGAAAACCAT ATGCCGGAGG TGATTGTCAT TGATGAAATC GGCACAGAAC TGGAAGCTTT AGCAGCCCGT ACAATTGCAG AGAGAGGTGT ACAATTAGTT GGTACTGCTC ACGGAAACCA AATTGAAAAC CTCATCAAAA ATCCCACGCT GTCTGATTTG GTGGGGGGTA TTCAAGCAGT GACGCTGGGA GATGACGAAG CGAGACGACG CGGTTCACAG AAGACCGTTT TGGAACGTAA AGCTCCACCT ACCTTTGAGA TTGCTGTGGA AATGATGGAA CGGCAACGCT GGGTAGTGCA TGAAAGCGTA GCTGATACAG TGGATACTCT GTTAAGAGGT CGTCAAGCTA ACCCACAAAC ACGAAGTATG GATGAAAATG GCAAAGTGGC GATTACCAGA CAATTATCTG TGGTTCAGGG TCGTGGAGGT AACTTAGCTA GTGAGGAAGA ATCTTTACCG GCTGTGAAGC AGGTTAATGG CTGGCGTAGT TCTGGTCAGA TGGTGGCTCT ACCACCTTTA TCTATAGAAC GGGAACGGGT GACAGGACGC AGTGAGTTTG ACCGCTTGCT GGATGAGTCT TTCAATTATT CTTCTGACAG TATTGATTTT AGTCATACTA AATCAGCCGG TCCAAACGGG GAAGATTTAC CTTTGCATAT TTACCCATAT GGGGTGAGTC GTCACCAACT GGAACAGGTG ATTAGTGTGT TAACTTTGCC TGTTGTATTG ACAAAAGACA TAGATAGTGG TGATGCAATT TTGGCGTTGC GATCGCACGT GAAAAACCAC GCTAAATTAA AGCAAATAGC CAAGGCTCGT CATTTACCAA TTCATGTGAT TAAGTCTAGC ACCATACCCC AAATTACTCG CGGTTTGCGG CGGTTGCTGA ATATTGATGA TCCAGAAATC GGTGATGACC GAGAATTACA ACTATTACTC CATAGTGGGA GTGATGACGA AATAGACGCT TTGGAAGAAG CCAGACTTGC AGTTGAGCAA ATTGTCATTC CTAAAGGTCA ACCTGTGGAG TTATTACCCC GTTCTCCGCA AGTAAGAAGA ATGCAGCATG AGTTGGTAGA ACATTATAGG CTCAAGTCCA ATAGTTTTGG GGAAGAACCA AACCGACGGT TAAGAATTTA TCCGGCATAA
|
Protein sequence | MTITDDLQKL LDILPQDLRQ ELENHPKRDY LVEVVLDLGR RPEARFPHAA EYLSETPVTQ AQIDDCIQRV GTFGGDNRAG IEQTLHRISA IRNRTGKIIG LTCRVGRAVF GTIGMIRDLV ETGKSILMLG CPGVGKTTAL REIARVLADD LNKRVVIIDT SNEIAGDGDI AHSAIGRARR MQVAKPELQH QVMIEAVENH MPEVIVIDEI GTELEALAAR TIAERGVQLV GTAHGNQIEN LIKNPTLSDL VGGIQAVTLG DDEARRRGSQ KTVLERKAPP TFEIAVEMME RQRWVVHESV ADTVDTLLRG RQANPQTRSM DENGKVAITR QLSVVQGRGG NLASEEESLP AVKQVNGWRS SGQMVALPPL SIERERVTGR SEFDRLLDES FNYSSDSIDF SHTKSAGPNG EDLPLHIYPY GVSRHQLEQV ISVLTLPVVL TKDIDSGDAI LALRSHVKNH AKLKQIAKAR HLPIHVIKSS TIPQITRGLR RLLNIDDPEI GDDRELQLLL HSGSDDEIDA LEEARLAVEQ IVIPKGQPVE LLPRSPQVRR MQHELVEHYR LKSNSFGEEP NRRLRIYPA
|
| |