Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1698 |
Symbol | |
ID | 9339491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1762135 |
End bp | 1765161 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003720972 |
Protein GI | 298490795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.967311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAT TATTACTAGA TAGCCAGCAC ACCTTAGTAC AATGGGTAAG CCAAGCAACA GGAATCAACA CTTTCGGGGT GAAAGTCCGG TTGTTGGGAA ATGACCTACA TATCCTTTGT GAAGATACCG ACTGTCCACA ACGTTGGCTA ACTCTGTCTG ACTTGCTACA AGCACTACAG CAAACAGATT TGGATAGCTT AACCAGTGAA GAACAATCCC CAATATACCA AGTATTGGTT TACGGGCGAA AAAAAGGAGA ACATCGTCCT CAATGGTGTC ATCGGGTCTA CTTGAATCAG TTAGATCGGC ATCTAGAGCA ACTACAGCAG GCACTGTTAA CAGAAGCAGC CAAATCTAAA CCCATTGGGG TAAGTGGAGC TCTGATTGTC TCTAATGAAA GTTTGGCACG TCAGGGCAAC CCGGATGCTA TTGCCCGCTA TCTTAGCGAA AACCTCAGTC AATTAGGTGT GGCAGTCCAA GTTAAGAATC GACCCTATGA TTCCAAAGCT AACTCTCAAA AAATAGGTAA TCGTCTGTGG ATTTTTTGTC AGTCCAGTTA TAGCCCTGAT GCGACCTTAT TAGCAGAACC AATAGCCCAA CAGTTGAGAC ATCTCAAGCT TTTTGGCTAT GAGGATGCAG TCATTGTTTC TCAAGTCCGG GGTGAGCATG ATCCTGATTG GCGGTTGCGC GTGGATTTGA CACCACCAGA AACAATGCTC AAAGAATGGG CGCGTTGGGG AGATGTGCAA GCGATCACTC TATTATTAAC TGAGGTCTTG TCTGATTTTC AAATTTCCGT CCAAGCTACT CTTAAAGAAT TTACTCTGCA CATTTTTTGC ACCCCAGCAG TTGACTTGTT AAAAACTGGT CCTGATCCTG ACAAGGAAAC TTGTTTACCA GTTATTCTGT CTCAGCTAGA AGCGATCGCA CCTCAAGGTA TTCTCGCCGC TGCTATCTAT GGACAAAAAA CAGGTGATTT ACAACCCGCT TGGGTAGATT GGGCATCTCT ACCTGCTTCA ATACATCCGG CTTTAGCAAC ATCGACACTA GCATTAGGTA GTTCTGGTGA CGAACCAGCC CTAATTTTCT TACTAGAACG TTTGCTCAAT CATGACATGA ATTGGCGGTT AAAAACAGGT GGTATTCGCG TTTTACTACT TCGCAAAGGT GATTTACTCC AGGTCATGTG TGATGCACCC AATTGTCCCA CACGCCAGCA AGTAGCCACC AAAGTCAGCG AATTTATTCG CCAACTTAAC ATTATTGGTA TTGTTGGTGT GAGAGTTTAC GGTCGTCGTG CCGGTGATAA TGAACCTGTT TGGCATCACG GACTCGATTT TGAACAACGC CACCGCTTAG TGCCAGAAGC AACACCAGAA TTTGCCGCTA CTGCTCAATA TGTTGATGAT TTGCTCACCA ACGAAACCGA CGAACCAATT GTGCGCCCTG ATTTAACAAC CCAAGAAGTT CAAAGTTTCG TGACTGAAGC TGCACAACAT TGGGTAGAAA GGGCTCACAA ACAAGTCAAA AATTTGTTAG TGGGAACTCA ACTGTTTGCA GAAACTAACC AATCAATTGA ACAAAACCCT AATGAACAAG GAGTAGCGGG AAGTTTAATT TGGGTAGCTT TGGGATTAAT GCTCACACTC CAAAGTGATT GGATGTTGGG TTACGTCATT ACTAGTATTC AGAATTCACC CAAAGGTGCT AGTATTTTAT CCCGATCATC CTCACAGACC CAGGTATCTT TGACATTAGG AGCAGAACCG AAAAGCACCG CATTTTTCAC CAGTAGTAAC AATATAAAGT CTTCTGAGTC CAGTAATTCT GCCTTTAATG CTTCTGGTTT TACCCAAAAT GATAGGGCTG GAAATTTACC AGATGCACCA TTGAAACAAA AAGCTAATTC CACTGCTATT CTTTTAGCAG CACGCTCTCA AATGCCAAGC TTTGGAGCAA GGCAACTAGA CGAACAATTA GCACTTTATA AAAAACGTTT AGCAACAACA GGCAAACCAC CAGATATTTT AATTATTGGT TCATCTCGTG CCTTGCGAGG AGTTGATCCT GTTGCCCTTT GTAAATCCTT GTCAACTCAA GGTTATGGCG AAATTGATGT CTTTAACTTT GGTATTAATG GTGCTACAGC CCAAGTTGTA GACTTGATTA TTCGCCAAGT TTTACAACCA TCAGAATTAC CAAGAATTAT TATTTGGGCA GATGGTTCTA GGGCTTTTAA CAGTGGTCGT GAAGATATCA CCTTTAACTC TATTGCCGCA TCAAAAGGCT ATCAAGAACT TTTAAAAAGA GCTACAGAAC CAGAAAATAA CAATAAATTA TCCCAAGCAA CAACAAACAA TAAAACAGAA GATAAAAAAC TCACCAACAA AACACCAGAT ATTAATAGCT ACGAAGCTGC TAATGAATGG TTAAATCAAA TTTTAGTCGG TAGATCTGCT ACCTACAGAA ACCGTGATCA CATTAAAACT CTATTGCAAA AACAATTACA CTATCTTCCA TTTAGTAAAG ACTTTCAGCC AGTTAAATCA AACAACAAAT TAAGAAACGA TCCACAAGAA GATAATACTC AATTATCAGT TGATTTTGAT GGCTTTTTAC CCCTAACTAT TCGTTTCAAT CCGACCACAT ATTATCAAAA ACATCCCAAA GTATCTGGAG GTTACGACAA CGATTATAAA TCTTTTCAAT TAATAGGAAA CCAAGACGTT GCTTTCAGAT CAGTAATTGA GTTTACAAAG ACTCAACAAA TACCAATCGT GTTTGTGAAT ATGCCCCTCA CCACAGATTA TCTAGATCCA GCGCGGACGA AATATGAACA GCAATTTCAA CAATATATGT TAAATTTTTC CACTACTAAT GCCAAGTTTA TCTACCGAGA CTTAAGCCAA ATCTGGCTCA AAGGACATGA ATACTTTTCT GATCCTAGTC ATCTTAACCG CTATGGAGCT TATGAAATCT CCAAAAAGCT GGCTATTGAT CCCATGATTC CCTGGACTCT TAAATAA
|
Protein sequence | MKTLLLDSQH TLVQWVSQAT GINTFGVKVR LLGNDLHILC EDTDCPQRWL TLSDLLQALQ QTDLDSLTSE EQSPIYQVLV YGRKKGEHRP QWCHRVYLNQ LDRHLEQLQQ ALLTEAAKSK PIGVSGALIV SNESLARQGN PDAIARYLSE NLSQLGVAVQ VKNRPYDSKA NSQKIGNRLW IFCQSSYSPD ATLLAEPIAQ QLRHLKLFGY EDAVIVSQVR GEHDPDWRLR VDLTPPETML KEWARWGDVQ AITLLLTEVL SDFQISVQAT LKEFTLHIFC TPAVDLLKTG PDPDKETCLP VILSQLEAIA PQGILAAAIY GQKTGDLQPA WVDWASLPAS IHPALATSTL ALGSSGDEPA LIFLLERLLN HDMNWRLKTG GIRVLLLRKG DLLQVMCDAP NCPTRQQVAT KVSEFIRQLN IIGIVGVRVY GRRAGDNEPV WHHGLDFEQR HRLVPEATPE FAATAQYVDD LLTNETDEPI VRPDLTTQEV QSFVTEAAQH WVERAHKQVK NLLVGTQLFA ETNQSIEQNP NEQGVAGSLI WVALGLMLTL QSDWMLGYVI TSIQNSPKGA SILSRSSSQT QVSLTLGAEP KSTAFFTSSN NIKSSESSNS AFNASGFTQN DRAGNLPDAP LKQKANSTAI LLAARSQMPS FGARQLDEQL ALYKKRLATT GKPPDILIIG SSRALRGVDP VALCKSLSTQ GYGEIDVFNF GINGATAQVV DLIIRQVLQP SELPRIIIWA DGSRAFNSGR EDITFNSIAA SKGYQELLKR ATEPENNNKL SQATTNNKTE DKKLTNKTPD INSYEAANEW LNQILVGRSA TYRNRDHIKT LLQKQLHYLP FSKDFQPVKS NNKLRNDPQE DNTQLSVDFD GFLPLTIRFN PTTYYQKHPK VSGGYDNDYK SFQLIGNQDV AFRSVIEFTK TQQIPIVFVN MPLTTDYLDP ARTKYEQQFQ QYMLNFSTTN AKFIYRDLSQ IWLKGHEYFS DPSHLNRYGA YEISKKLAID PMIPWTLK
|
| |