Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0599 |
Symbol | |
ID | 9338385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 627259 |
End bp | 630387 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | |
Product | translation initiation factor IF-2 |
Protein accession | YP_003720210 |
Protein GI | 298490033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAACG GAAAAGTTAG AATCTACGAA TTATCAAAGG AATTGAATTT GGATAACAAA GAGCTATTAG CAATTTGCGA CCAGCTCAAC ATTGCGGTCA AAAGCCATAG CAGCACCATA TCAGAATCTG ATGCTGAAGA AATTCGTACA GCTGCGCAGA AACTCGCAGC TACGAGTGTC ACGCCCAAAA AGGAATTAGG TACAAATAAC CATAAACCAA ATTCACTTCA AACTGGCGGA CGTAACCGAC CTGCTGCAAC CCATAAACAA CAAATTTTGG AAATTCGCAA ACCCAAAATA TTGAGAAACA CTACCTCACG CCCTGATAGC GATTCAATTA CTACCAATAA CCAAGTTCCT TCGTCCGATA GCTTACAAGA CGGAGCCTTA GTTAATTCTC CCTCTCCTCC AAGGCCTTTC GCTACACAAG TCTCACCCAT GAAGCCGACG GCACCAACTC GACCTGTACC CCGGAATCAA TCGGAGACCC CGCAGAAACC CAAGATTCCG GAAACGGATA AGACCACAAA TCCAAATCCT CCAGAATCTG AAAAAATAGC AGCGGAAAAA CCAGAAAAAA CAGTTTCTCC CAGGCCAAAA CCAGAAAAAA CTCCAAAACC ACAACTGGTA GATCCACCCT CAAGACCAGC AGAGGAAACA GAGCCTGTAA GTGAACAGCT ATCTCAGGCA GTAAAACCCA TCCTCAAACG CGAGCGTCTA AAGCGTGGAG ATGAAGAACG TGAACAAGGT AAGCCAAAAG TAGGCAAACC AAGCACAGAG CAAAGAGCGC GTTCAACTCC ATCATCTCCC ACCAGACCAG AACCGAGAGG AAACAGACCA TCTCCTCTAG GAACATCATC TATAGATGTA CAACGCCCCA GACCATCTCG CCATAGTGAA CCTCCAGCTG CTCCAGTTGC TACCCCACCA AGACAGATAG CAGGAGTAGG GGCAAAAAAA GAGGGATTAG ATGATACCCC AATCACACCT GATCTCCTTG ATTTAAAACG CCCATTGCCG CCTCGAATAG CCAAAGGTGG GAAAAAATGG CAAGAAGAGG AAATAATTGA CGAGATTAAA GAAAAGGCAA AGGTTGGACC AAAAGGTAAA CGAGCTAAAC CCATCCTAGA TGATGATTTT GAAGAAGACG ACTTGCTTGA TGAAGATGGG TTGGCAATTC CTGCCACCGT CCAAGTAAGC CTTTCCATTG CTCGTCCTTC TAAACCCAAG GTGACTAGAT CTCCACAGCA ACCAACCCTA GCTGCGTACG CACCAACTAC TAAAAATAAA AAATCGGGTT CATACCGTGA CCAAAACCGT CGTCAACAAG AAGTAGAAGT CAAGCGTGAG CGTCCAGAAA AACTGATCGT CACAGGACCC TTGACAGTAC AGGAGTTGGC TGAAGGCTTA GTGATGGCCG ATACAGAGAT CGTCAAGATC CTGTTTATGA AAGGCATGGC GGTAAGTATT ACCCAAAACC TGGATATTCC GACCATTACC TTAGTAGCTA AAGAATTAGA AGTAGAAGTT GAAACGGCAG AACCAGAAGC AGAAGCCCGC AAAGTCACAG AAATGATTGA CATCGCGGAT CTCGAACACC TGATTCGTCG TCCACCAGTC GTGACAATTA TGGGTCACGT AGACCACGGT AAAACTACTC TGCTCGACTC AATTCGCAAA ACTAAAGTAG CGGCTGGTGA AGCTGGTGGT ATTACCCAAC ACATTGGTGC ATACCATGTG GATCTAGAAA ATGAGGGCAA ACAACAACAG ATTGTTTTCC TAGATACCCC CGGTCACGAA GCCTTCACAG CGATGCGAGC TAGAGGCGCA AGGGTGACAG ACATTGCTAT TTTGGTAGTT GCCGCAGATG ACGGAGTGCG TCCGCAAACA GTGGAAGCTA TCAGCCATGC TCAAGCCGCA GGTGTGCCAA TTGTTGTCGC TATTAACAAG ATTGACAAAG AAGGCGCACA ACCAGAGCGC GTTAAACAAG AACTAACCAA TTATGGTTTA ACCGCAGAAG ATTGGGGTGG TGAAACCATC ATGGTTCCTG TGAGTGCCAT CAAGGGAGAA AACCTAGATA CACTCCTAGA AATGGTTCTT CTTGTAGCAG AAGTAGGAGA ACTATCTGCC AATCCAAATC GCGTCGCTAA GGGAACAGTT ATTGAAGCCC ATTTGGATAA AGCTAAGGGT GCAGTTGCTA CCCTGCTGAT TCAAAATGGG ACTCTCCATG TTGGAGATAT GTTAGTAGCT GGCTCGGCAT TTGGTAAAGT CCGGGCAATG GTAGATGATA GAGGTAAGCG TGTAGAAGCT GCAAGCCCAT CCTTTGCTGT TGAGGTATTA GGTTTAAGTG ATGTACCTGC AGCAGGTGAT GATTTCGAGG TGTTCGCGAA CGAAAAAGAA GCTCGCTCCC TCGCAAGTGA TCGCGCCGAC AAGCAACGCC AATCCCGCCT GTTACAAGGA AGAGTCACAC TGACAACCCT ATCAGCTCAA GCACAAGAAG GCGAATTGAA AGAACTTAAC TTGATCTTGA AAGGAGATGT CCAAGGTTCT GTAGAAGCCA TTATCAGCTC TCTCAAGCAA ATCCCTCAAA ACGAAGTACA AATTCGGATG TTGTTGGCTA CTGCTGGAGA AATCACAGAA ACAGATATAG ACTTAGCTGC CGCTAGTAAC GCCGTCATTA TTGGCTTCAA TACCACCTAC GCTAGTGGTG CCAGACAAGC TGCCGATGAA GCAGGTGTAG ACGTGCGTGA ATACAACATC ATCTACAAAC TCCTAGAAGA TATCCAAGGA GCTTTGGAAG GTCTATTAGA ACCAGAGTTG GTAGAAGAAC CTCTAGGACA AACAGAAGTA CGTGCTGTCT TCCCTGTTGG TCGTGGTGCA GTAGCTGGTT GTTATGTACA ATCAGGCAAA CTGCTTCGCA ACTGCAAAGT GCGTGTACGT CGTGGCAATA AAGTGGTCTA CGAAGGCGTT CTTGATTCCC TCAAACGGAT GAAAGAAGAT GTCCGCGAAG TCAATTCTGG TTATGAATGT GGTATCGGTA TTGATAAGTT CCATGACTGG GCTGAAGGTG ACATCATCGA ATCCTACCAA ATGGTAACTA AACGTCGTAC TCTTACATTA ACTAAGTAG
|
Protein sequence | MNNGKVRIYE LSKELNLDNK ELLAICDQLN IAVKSHSSTI SESDAEEIRT AAQKLAATSV TPKKELGTNN HKPNSLQTGG RNRPAATHKQ QILEIRKPKI LRNTTSRPDS DSITTNNQVP SSDSLQDGAL VNSPSPPRPF ATQVSPMKPT APTRPVPRNQ SETPQKPKIP ETDKTTNPNP PESEKIAAEK PEKTVSPRPK PEKTPKPQLV DPPSRPAEET EPVSEQLSQA VKPILKRERL KRGDEEREQG KPKVGKPSTE QRARSTPSSP TRPEPRGNRP SPLGTSSIDV QRPRPSRHSE PPAAPVATPP RQIAGVGAKK EGLDDTPITP DLLDLKRPLP PRIAKGGKKW QEEEIIDEIK EKAKVGPKGK RAKPILDDDF EEDDLLDEDG LAIPATVQVS LSIARPSKPK VTRSPQQPTL AAYAPTTKNK KSGSYRDQNR RQQEVEVKRE RPEKLIVTGP LTVQELAEGL VMADTEIVKI LFMKGMAVSI TQNLDIPTIT LVAKELEVEV ETAEPEAEAR KVTEMIDIAD LEHLIRRPPV VTIMGHVDHG KTTLLDSIRK TKVAAGEAGG ITQHIGAYHV DLENEGKQQQ IVFLDTPGHE AFTAMRARGA RVTDIAILVV AADDGVRPQT VEAISHAQAA GVPIVVAINK IDKEGAQPER VKQELTNYGL TAEDWGGETI MVPVSAIKGE NLDTLLEMVL LVAEVGELSA NPNRVAKGTV IEAHLDKAKG AVATLLIQNG TLHVGDMLVA GSAFGKVRAM VDDRGKRVEA ASPSFAVEVL GLSDVPAAGD DFEVFANEKE ARSLASDRAD KQRQSRLLQG RVTLTTLSAQ AQEGELKELN LILKGDVQGS VEAIISSLKQ IPQNEVQIRM LLATAGEITE TDIDLAAASN AVIIGFNTTY ASGARQAADE AGVDVREYNI IYKLLEDIQG ALEGLLEPEL VEEPLGQTEV RAVFPVGRGA VAGCYVQSGK LLRNCKVRVR RGNKVVYEGV LDSLKRMKED VREVNSGYEC GIGIDKFHDW AEGDIIESYQ MVTKRRTLTL TK
|
| |