Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4355 |
Symbol | |
ID | 9342160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4433353 |
End bp | 4436931 |
Gene Length | 3579 bp |
Protein Length | 1192 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | TPR repeat-containing protein |
Protein accession | YP_003722816 |
Protein GI | 298492639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.862224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCAAA TGCTCAGGTG GCTATTGCAG TGGCTTAACA GGTTTATGAA GTATCCTTTT AACAGTCGTC GGACTCATGA TGCCAACGAT AAAAAGGGTC ATCAGGTGGT AGAGTCTCTA CCTGAATTAA CCAATGCTGA TCTAGAAGTA TTGTTTAACG AGCTGCTTGA AGGTGTGCAT CAGGCGCGGG GAAAACAATG GGCGCTGAAG TATTTGCAGC GGATGGAACC ACGAATTACT GTTGAACGTT GGATAGATTG GCTGCTGATA TTTGGTGAAA GATTGTTATC TTCACCTGCA CCAAATAGTC AGATAGCAAC GCGAATGGTT AAATTGGGTG AACTCGATGT TGGTAAAGTT GGTGAACTTG CATCAGAAAT TGGTGTCCAG CTATTGAGTC GTGAGTTTTT AGCCCAAAGG TATGTCGAGA ATCCACAAGC AGTGGAGGTG GAAACAGCAG TAGCAGTAGT GGCAGATACT CCTGGGCAAC AATTACTGCG GGACTTTGGT GAAGAGTTAT GGGAGCATGA TCAGGAAGAA CCGGTGAACA ACATCACACC AGTATCCTTA AGCTTGGAGG AATCATGGAC AGGGAACTCA GAACAGTGGA GTGCTCAAAA TTTAGAACAA CATACATCTG AGGCTGTTAT TTTTGAGTAT GTCCAGGAAG ATAATCCCAC TGCGATTGAG GAAGTAGAAT CTGTTAGTGA TTCTTCTTTG GCAGAGAATT GGGATCAGTG GCTGGTAAGT TTAGAGCCCA AGGTGGCACA TACTCTGGAT GAGTTGGTGG TAAGGTTGGA GCAAAGCACC AATTTAGTCC AGCAACTTGC TTCTGAATTA GCTATTCGCG ATGAACGCCA AGCATTAATT CAGCGTCCTA GTTTTCAAAT AACTGTAACT AATCAAGCTC AAGCATGGTT TTATCAAGGT TTACAGCAAG CTAAATCTGG CGATTTGTTA GGGGCGCTGG CTTTTTATAA TCAAGCTACT AAACTAGAAC CAGAATCGGC TGAGTATTGG TTTAATCAAG CTTTAACCTT ATTCCATTTA AAACGTTTTG AAGAAGCGAT CGCAGCCTAT GACCAGGCGA TCGCACTCAA GCCAGATTTC TTTAAAGCCT GGTATAATCG GGGGGGAATT ATGGTCGAAT TTGGCGACTT TGACGGAGCG ATCACTTCTT TTGACAAGGC TATAGAATTG CAACCCAACT ATCAAGAAGC TTGGTCTAGT AGGGGTTTAG CGCTGCTGAA ATTAGGACTG ATTTGGGAAG CAATTTCCAG TTATGACCAA GCCTTGGAGT TACAACGCCA AGACCAGGAA ACTTGGTATT ATCGAGGAGT TGCCTTGGCT GTAGGAGAGC AATATGAAGA TGCGATCGCC TCATACAACC AAGCTATAGA AATTCAACCA GACTATCACG AAGTTTGGAT TGACCGGGGA GTAGTCTTAT TTAACCTCAA GCGCTGGTCA GAAGCCATTG AATCCTGGGA TCAAGCCCTC TCTATTCAAC CAGACTTTTA CTTAGCCTGG TATAACCGAG GCATCGCCTT GGAAAACCTA GCCAGACGAG AAGAAGCCAT TACTTCCTAT CAAAAAGCGA TCACCATTAA ACCCGACTTT CACCCTGCAT GGTACAACCA AGCCGTAGCC TTCTATTATT TAAATAGATT TGCAGAAGCC ATTTCTTGCT ATGACAGCGC CTTAGAAATC AAACTAGACT ACTGGGAAGC TTGGCTTGGT CGTGGTGGAG CCGTTGGTAA CTTAGTCAAT GACAAATTCT CCCTGAGTTT ATCCAGTACT ATAGCCGCAT CCAATCCCAA TCTAAATCAG CTTGGCTATG AGGGCAAATT AGCCACATAT CAAGAAGGCT TTAAATATCT TCGTCCAGAT ACTCACCCCG AAGGTTGGGG AAGATTGCAT CTAGCTGCTG GTAATACACA TTACGAACAC GGCAAAAAAC AATCCACACC CCGCTATTTT TGGCAAAAAG CCGTATCTGA ATACCAACAG GCACTCTTAA CCCTCACAGC CGAAGATTTC CCAGAATTAC ATCTTGATGG TTTACAATCT CTCACCAAAG TGCTTATCCG TTTGGGACAA ACAGTAACAG CCCAAGAATT ACACCAACAT GGACTAGGCT TATTACAACA ATTACTTAAT CAAACAACCC GTCCTGAACA CAGTAAGAAA CAACTAGCTT TAAAATTTGC GGGTTTAAGA CAAATGGGAG TTGACTTAGC AGTCAATATA GGTGATTTAG TAGAATCTTG GGAAATTGCC GAACATTGTA AAAATACCTG TTTAAAATTA CTGCTTTCTG ATTGCCATGA TGAAATTTAC TCTCCCAACT ATGAAGCCAT TCAACCTCTA CTAAATTCTA CAACAGCCGT GATTTACTGG CATCTTAGTC CAGCAGCCTT ACACACCTTC ATTATTAAAC ATGAAGCTCC TTCACCCATA CTGTTATTAA CACCAATACA AGATATAGAA GCTATACCCG AAGCCGTGCA ACGTCTAGTT GAATTTGAAA ACTGGCTAGA AGATTGGGAA AAACAATATC AAGAATATCG TCAAATACAA GATATAGAAA ATCAGTACAA ACATTCTTGG TGGGTAGATA TAGAACAGAA GTTGTTACAA CTGCAAAACA TCCTCAATAT TTCTACAATT ATTCAAGAAC TTGAAGGTAT CTCCAAACTG ATTGTAATTC CCCATGGTGA TTTACATAAA TTACCTATCC ACGCACTTTT TCCACTCAAT CAGAAAAATT CACTCAATTA CACCATCAAC TATTTACCCA GTATCCAAAT AGGCCTGGAC TTAAAAACAT ATTCATTATC AAATTGGCAA CAGCAAAAGT TCCTCAGTGT TGAAAATGTA GAACATACAA ATGATAGCCA GATAAAATGT GCTGATTTTG CATCGGCAAT TATCAGAAAA ATGTTTGATA ATGCCCAACA TATCCAAGGT TCACAAGTTA CGCAAGATAA TATCGAAAAT GCCTTAGCGG CAGATTACAA TATCTTTCAC TTTACTGGTC ATGCTATCAA CAATTTGAGT GAAGCTCAAA AATCAGCTTT AGTTTTAACA AGTGAAGAAA AACTGACTCT AGCAGAAATT AGTCAACAGA CTTTTAATAC TTACAATCTG TTTACTCTCC CAAATTGTGA AATGGTTAGT AATCACAGTC AGAATATCAA CAGTGAATAT GTGGGTTTAG CAGCTGGTTT ACTAATTCGT GGAGTTCCAG AAGTGTTGAG TACACTTTGG ATTGTTGAAT CATCTGCGAC AGCTTTAGTA ATTATCGAAT TTTATCGCAG ATTACTTTTC CATAAATCTC CAGTTACTGC TTTAGCTGAA GTCACAACTT GGCTGAGAGA TATAACAGTT GGTGAACTAA TCACATGGTA TGAAGATTTA CTCACTAATC TGCATTCAGA TGAAGTTAAA TTGAGGAATT ATGTAATGAT GGAAGTTGAT AAATATCGTC AGTTATCACC TCACAAACAG CCTTATCAGC ATCCTTATTA TTGGGCTGCA TTTATCATTA CAGGATGCGT TCAAGCAGAT GAGAATTGA
|
Protein sequence | MWQMLRWLLQ WLNRFMKYPF NSRRTHDAND KKGHQVVESL PELTNADLEV LFNELLEGVH QARGKQWALK YLQRMEPRIT VERWIDWLLI FGERLLSSPA PNSQIATRMV KLGELDVGKV GELASEIGVQ LLSREFLAQR YVENPQAVEV ETAVAVVADT PGQQLLRDFG EELWEHDQEE PVNNITPVSL SLEESWTGNS EQWSAQNLEQ HTSEAVIFEY VQEDNPTAIE EVESVSDSSL AENWDQWLVS LEPKVAHTLD ELVVRLEQST NLVQQLASEL AIRDERQALI QRPSFQITVT NQAQAWFYQG LQQAKSGDLL GALAFYNQAT KLEPESAEYW FNQALTLFHL KRFEEAIAAY DQAIALKPDF FKAWYNRGGI MVEFGDFDGA ITSFDKAIEL QPNYQEAWSS RGLALLKLGL IWEAISSYDQ ALELQRQDQE TWYYRGVALA VGEQYEDAIA SYNQAIEIQP DYHEVWIDRG VVLFNLKRWS EAIESWDQAL SIQPDFYLAW YNRGIALENL ARREEAITSY QKAITIKPDF HPAWYNQAVA FYYLNRFAEA ISCYDSALEI KLDYWEAWLG RGGAVGNLVN DKFSLSLSST IAASNPNLNQ LGYEGKLATY QEGFKYLRPD THPEGWGRLH LAAGNTHYEH GKKQSTPRYF WQKAVSEYQQ ALLTLTAEDF PELHLDGLQS LTKVLIRLGQ TVTAQELHQH GLGLLQQLLN QTTRPEHSKK QLALKFAGLR QMGVDLAVNI GDLVESWEIA EHCKNTCLKL LLSDCHDEIY SPNYEAIQPL LNSTTAVIYW HLSPAALHTF IIKHEAPSPI LLLTPIQDIE AIPEAVQRLV EFENWLEDWE KQYQEYRQIQ DIENQYKHSW WVDIEQKLLQ LQNILNISTI IQELEGISKL IVIPHGDLHK LPIHALFPLN QKNSLNYTIN YLPSIQIGLD LKTYSLSNWQ QQKFLSVENV EHTNDSQIKC ADFASAIIRK MFDNAQHIQG SQVTQDNIEN ALAADYNIFH FTGHAINNLS EAQKSALVLT SEEKLTLAEI SQQTFNTYNL FTLPNCEMVS NHSQNINSEY VGLAAGLLIR GVPEVLSTLW IVESSATALV IIEFYRRLLF HKSPVTALAE VTTWLRDITV GELITWYEDL LTNLHSDEVK LRNYVMMEVD KYRQLSPHKQ PYQHPYYWAA FIITGCVQAD EN
|
| |