Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2133 |
Symbol | |
ID | 9339928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2212987 |
End bp | 2215698 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003721277 |
Protein GI | 298491100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0029443 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCAAC AAGAAAAAAA AGATGGTGTA ATCGTTAATC TGGTATTATC GCTGGCCTTA GCTACCAGTG GTGTGGTAGC CAACTTATTT GTGTTAGCTC CTACACAGGC AGAATTAAAA TCTGATTTTA CTACTTTCCC GCTGCCGCAA ACAGTGGAGG ATGGAATCAA AGTGCGAATT GATGGTTCTG CGAGTTTGGT CATGATTAAC CAAAGCCTAA AAGATACATT TGAGAACCAA TTTTCTGGTA CACAGATAGA AGTGGGGGTG AATAGTGCTG ATGCTGATGC TGCACTCAAG ACTTTGCTAG AAGGCAAAAT TGATATAGCT GCCATCGCAC GAGACCTAAC TCCAGCAGAA AAAGCTCGAG GTTTAGAACA AGTCCATTTG CACCGAGAAA AAATAGCCAT CATCGTTGGT GCAAATAATC CCTTTCCAGG AAGTTTGACT CCTGAAAAAT TTGCTAAAAT TTTTCGAGGC CAAATTAAGG ACTGGTCAGA ACTAGGAGTT CAATCTGGTA AGATCCGGCT AATTAATCGA CCCTCAACAA GCAATACTCA TAATGCCTTT CGTGATTATT CAGTTTTTCA AACTGCTGAG TTTCCTACAC GAACGAATCC GACTCAAATA GCTGAAGATA AAACTGCCCA AATTATCCAA CAATTAGGTA CAGATGTCAT TAGCTATGTT ATAGCTAATC AAGTATCAAA GCTACTAGAT GTGCGAGTTC TGAAAATCAA GGGAGTTACA CCAGGTAATT CTCAATATCC ATTTTCTCAG CCTTTGGTTT ACGTTTACAA GCAAAATCCC AACCCAGGAG TAGTTGGTTT TCTGAGTCTT ACCCTTGCAC CTGTAAGAAA AAAGGCGCTA GAACCTCCTA GAGAAGCTGA AGCCTCTGCG ATCGCAGCCA GTTCTTTACA AAGTGTTAAT CGAGAAACTT TAACAACTTC CTCATCAAAA CCTCAACCCC TACTAACTGT TGCACCATCT GAAAATTCCA CTATAAGTAC CACTCCACAA TCACAAACTC CAATCAATAC TCTTGGTTCT GGTAATGAAC AACAGTTTGT GAGTCCTCTG GAAAATGATC CGCTTGAGGA TAAGAACGTC ATACTCTTAA TAGTTTTATC GCTATTGCCA ATTTTTGGTT TAGGTGGGTT TCTAACTTGG TGGTTCAAGA GAAAACTGCG ATCAGTAGAT GAAAAAACAG ATAACTTGGA AACATTAATT TCCAGCACAT CTACCACAGA AACAATATCA ATCACACCAG ACGATTACAG CATTCTTCCC TATCTTGAAA ATGGTAGCTG CACTAATGGA ATATCGCATT TAAATCAAAC TACAACTACA ACTTCAATGC TATCCGACAA GGAATATCTT AACCTTACTC AAGAAGATAA CCTAACAGGT AATTTATTAA CAGCAATTGT CACAGGAACA AATACCAGAG TAGATCATAC AAATGATTTT GACATTCCTA CTGAAACAAT AGCTGTTGAT TGTGGTGAAG TAGTATGGGA TACAGAAGCG CCAGTGGCTG TTGTTAATAC ACCTTACCCA TCAGTACCCA GAATTTCAGG AATTACATTT GATGTAGAAC TGCTAACTTA CGAATTAACC ACTTCACTAT CAGAATTACT AGATAACCCA GCAGCGCCAT TTCATCAAGA TACCACTACT CCACTATCAG AAGTAATAGG TTTTCCACCA ATTTCATCCG ATGCTGACTC TAGTACTTCA CTCTCAGAAT TACTCGGTAT GGCAGCAACC TCTCTTGATA CTGATTCCAG TAATACTCCA CTAAAATTAC GTCCTGTATC TACAAAAGAG CCTATTACGT CCCTATCAGA ATTATTGGGC TTACCACCAG AAACATTAGA TTTAGATATA GCACTGAGCA AAGATGAAAC AACAAGTTCA CTACCTGAAC TATTAGATGA GTTAGGAGAT TTATTCAACA ACTTAGCAGA GGCTGAACTC AAAATTGATC TGACACCAGA AGAGTTTTCC TCAGACTTGT CTATTTCATC AATGTTCTCA GAAGAGACTA TTGACTATGC AATCTTGAAA ACAGATGCAA AGATAGAAGT TTCATCAGAG TTAAAAATAC GAACTAACAT CACAGAATTT GCTAGTTTCT TGGATATAGA CACAGATAGC AGCATTGTCT TCACACCCCG TACACCTAAG TGGGCTTATG TTTCTTGGTA TGTTTCAGAA ACTCACAAAG AAGTACTGCG AAAAAAAGGA GGTCGTCTTT TAGCAGTGAG GCTTTATGAT GCTACTGACA TTGATCTGAG TTATCAAACA CCCCAACTAG TTCAGCAGTA TGAATGTGAG GAAGCAACTT GCGATCGCTA TATAGATATT CCCACTAGCA ATCGTGATTA CATAACTGAA ATTGGCTATA CAACAGATAA TAATTGTTGG TTAGGTATAG CTCGTTCAGG TACTATTCGG ATCTTCAATC CTCCTAGTGA AGATTTCTGG TTTGTCACAG ATACAGAACT AGTTATTCAT GGATCTACCG AACCAGGAGC AAAAGTGACT ATTGATGATC ATGAAATTGA AATTCAACCT GATGGAACCT TCAATTTCCG TGTTCCCTTC TCCAATAGTT TACTTCAATA TCTGATGACA GCAACTGCTG CTAGGGGAGA ACAAACTATC ACCATCCTCA AGAAGTTTTC CCAGGAAAAT CCAGAAGATT AA
|
Protein sequence | MWQQEKKDGV IVNLVLSLAL ATSGVVANLF VLAPTQAELK SDFTTFPLPQ TVEDGIKVRI DGSASLVMIN QSLKDTFENQ FSGTQIEVGV NSADADAALK TLLEGKIDIA AIARDLTPAE KARGLEQVHL HREKIAIIVG ANNPFPGSLT PEKFAKIFRG QIKDWSELGV QSGKIRLINR PSTSNTHNAF RDYSVFQTAE FPTRTNPTQI AEDKTAQIIQ QLGTDVISYV IANQVSKLLD VRVLKIKGVT PGNSQYPFSQ PLVYVYKQNP NPGVVGFLSL TLAPVRKKAL EPPREAEASA IAASSLQSVN RETLTTSSSK PQPLLTVAPS ENSTISTTPQ SQTPINTLGS GNEQQFVSPL ENDPLEDKNV ILLIVLSLLP IFGLGGFLTW WFKRKLRSVD EKTDNLETLI SSTSTTETIS ITPDDYSILP YLENGSCTNG ISHLNQTTTT TSMLSDKEYL NLTQEDNLTG NLLTAIVTGT NTRVDHTNDF DIPTETIAVD CGEVVWDTEA PVAVVNTPYP SVPRISGITF DVELLTYELT TSLSELLDNP AAPFHQDTTT PLSEVIGFPP ISSDADSSTS LSELLGMAAT SLDTDSSNTP LKLRPVSTKE PITSLSELLG LPPETLDLDI ALSKDETTSS LPELLDELGD LFNNLAEAEL KIDLTPEEFS SDLSISSMFS EETIDYAILK TDAKIEVSSE LKIRTNITEF ASFLDIDTDS SIVFTPRTPK WAYVSWYVSE THKEVLRKKG GRLLAVRLYD ATDIDLSYQT PQLVQQYECE EATCDRYIDI PTSNRDYITE IGYTTDNNCW LGIARSGTIR IFNPPSEDFW FVTDTELVIH GSTEPGAKVT IDDHEIEIQP DGTFNFRVPF SNSLLQYLMT ATAARGEQTI TILKKFSQEN PED
|
| |