Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1696 |
Symbol | |
ID | 9339489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1758379 |
End bp | 1761432 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_003720970 |
Protein GI | 298490793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCCC TTTTATACTC TTCATCCCCA GCAGTGAATC TTTACCCCTC GGAATTATTC TTGCGTCATC GTCTACAGGT AGTAGAGGAA TTGTGGGAGT CAGTTCTTCG GCAAGAATGT GGTCAAAAGA TGGTAGATCT ATTGGGACAA TTGCGCGATT TGTGTTCCCC AGAAGGACAA GCTACTCATG ACCAAACCGC CTCTGCTGTG GAGTTAATTG AACAACTGAA TATCAACGAG GCTATTCGTG CTGCTCGTGC TTTTGCTCTT TATTTTCAGT TGATTAATAT CATAGAGCAG GAATACGAAC AAAAGCAGCA GTTAACTCGC TATTCTGATT CAGGCACAAT CAATCAGGAA CATCTCGCCA ATATTATTTA TTCCACTAAC CAAAGAGAAG ACGATTTACC TGTAACTAAG GAACTAGGAG CAGATTCCCT ATCACAAAGT TGGACAGACA CTACGCCAAT TAAACAAAAA GGCACATTTG CGGCATTATT TCCCCTGTTG TTTAAACTGA ATGTACCACC CCAGCAAATT CAACGGTTGA TTTCTCAACT AGATATTCGC TTGGTTTTCA CGGCGCACCC CACGGAAATT GTCCGTCATA CGATCCGAGA TAAACAGCGA CAGGTAGTAG ACCTCTTGCA ACATCTGGAT AGCTTGCAAA ATCGTTCTGG TGGCTATCCT TGGGAAGCTC AAGAAGTGAA AGAGCGTTTA TTGGAAGAAA TCCGCCTGTG GTGGCGTACA GATGAACTGC ACCAGTTCAA ACCAACGGTG CTGGATGAAG TAGATTATGC TCTGCACTAT TTCCAAGAAG TCTTATTTGA TGGTATTCCC CAACTGTATA AACGTCTCAA ATATTCCCTA GAACAAACAT TTCCTTGGTT AGAACCACCA AGTAAAAATT TCTGTTCCTT TGGTTCTTGG GTAGGTTCAG ATAGGGATGG AAATCCGTCA GTGACACCAG AAGTTACATG GAAAACAGCT TGTTATCAGC GGAAAATGGT GTTGGGAAGA TATATTCAGT CGGTGAAGCA GCTGATTGAA TTATTAAGTG TGTCCATGCA GTGGAGTGAT GTGTTGCCAG ATTTGCTGGA GTCACTGGAG TTAGATCAGT CTACGATGAG TGATGTATAT GATGCTCTGG CGTTGCGCTA TCGTCAAGAA CCATATCGCT TAAAGTTGGC CTATGTGCTG AGAAGATTGG AAAATACACG CGATCACAAT CTGGCTTTAT ATAGTCGAGA AAGACCAGCA AATGAAGATT CCCCCATGTA TCGTTCAGGG GCTGAATTTT TATCAGAACT GCGGTTGGTT CAACGCAATT TGACAGAAAC GGGTTTAAGC TGTCGAGAGT TAGAAAATCT CATATGTCAA GTGGAAATTT TTGACTTTAA CCTGACTCAG CTAGATATTA GGCAAGAATC ATCTCGTCAT TGTGATGCAC TGAATGAGAT TCTCGAATAC CTGCAAGTTT TACCCCTATC TTATAACCAA CTATCAGAAG CTCAAAGAGT GTCTTGGTTA ACTGGGGAAC TGCAAACAAG ACGGCCGTTA ATTCCTGGAG AGTTGCCATT TTCAGAAAAA ACCAATGATG TAATTGAAAC CTTCCGAGTT GTGCGATCAC TACAACAAGA ATTTGGCATC AACATCTGTC AAACTTACAT TATCAGTATG TGCCGGGAAG TCAGCGATGT TTTGGAAGTT CTGCTCTTAG CCAAAGAAGC CAGACTATTT GATCCAGCGA TCGCTGTAGG TTCAATTAGA GTCGTCCCAC TATTTGAGAC TGTAGAAGAC TTACAACGCT CTAGAAGCGT GATGAAAAAA CTTTTTGAAC TCCCCCTATA TCGCGCCTTC TTAGCTGGTG GCTATGAAGC ACTTAACTCC GAAAATACTC CCCCAGATAC CCAACCACCC AACTCTCCAT CTTCACCCAC CCTGAACCCC AACTTGCAAG AAGTGATGCT GGGGTATTCT GACAGTAATA AGGATTCTGG TTTCTTAAGC AGCAACTGGG AAATTCACAA AGCCCAAAAA TCACTCCAGA AAATTGCCGA ACAATATGGC TTACACCTGC GGATTTTCCA CGGACGCGGC GGTTCTGTAG GTCGGGGTGG TGGCCCTGCT TATGAAGCGA TTTTGGCTCA ACCTGGTAAC AGTATTAATG GACGCATCAA GATTACTGAA CAAGGAGAAG TTTTAGCTTC TAAATATTCC TTGCTGGACT TGGCTTTATA TCATGTAGAA ACCATCACAA CTGCGGTAGT TCAAGCTAGT TTGTTGCGGA CAGGGTTTGA TGATATTGAA CCATGGAATG AGATCATGGA AGAATTGTCA ATGCGATCGC GCCAACATTA TCGCGGTCTA ATTTACGAAC AACCCGATTT TATCGACTTC TTCCACCAAG TCACCCCCAT TGAAGAAATC AGCCAACTGC AAATTAGTTC GCGTCCAGCG CGACGACCAT CGGGTAAAAA AGATTTAAGC AGTTTGCGCG CTATTCCTTG GGTATTCAGC TGGACACAAA CCCGATTCTT GTTACCTTCT TGGTATGGCT TAGGTACAGC TTTACAAGAG TTCTTGAACG AACAGCCAGA AGAACACCTG AAATTGCTGC GCTATTTTTA TGTTAAATGG CCTTTCTTCA AAATGGCAAT TTCTAAAGCG GAAATGACCT TGGCAAAAGT AGACATTGAA ATGGCACATC ATTACGTCCA GGAACTATCC AACCCAGAAG ACAAAGCCCA GTTTGATAAA GTATTTGAGC AAATTGCTAG TGAATTTTAT CTAACTAGAG ATTTGGTCTT AAATATCACT GGACACCAAC GACTTTTAGA CGGTGATCCC ATCTTGCAAC GTTCCGTACA ATTACGTAAT GGGACAATTG TGCCATTAGG ATTTATACAA GTTTCTATCC TGAAGCGTTT GAGACAGTAC AAAAACACCA CGACCTCTGG AGTAATTAAC TCCCGTTACA GCAAAGGAGA GTTGCTTAGA GGAGCATTAT TAACCATTAA CGGTATTGCT GCAGGAATGA GAAATACAGG TTGA
|
Protein sequence | MGSLLYSSSP AVNLYPSELF LRHRLQVVEE LWESVLRQEC GQKMVDLLGQ LRDLCSPEGQ ATHDQTASAV ELIEQLNINE AIRAARAFAL YFQLINIIEQ EYEQKQQLTR YSDSGTINQE HLANIIYSTN QREDDLPVTK ELGADSLSQS WTDTTPIKQK GTFAALFPLL FKLNVPPQQI QRLISQLDIR LVFTAHPTEI VRHTIRDKQR QVVDLLQHLD SLQNRSGGYP WEAQEVKERL LEEIRLWWRT DELHQFKPTV LDEVDYALHY FQEVLFDGIP QLYKRLKYSL EQTFPWLEPP SKNFCSFGSW VGSDRDGNPS VTPEVTWKTA CYQRKMVLGR YIQSVKQLIE LLSVSMQWSD VLPDLLESLE LDQSTMSDVY DALALRYRQE PYRLKLAYVL RRLENTRDHN LALYSRERPA NEDSPMYRSG AEFLSELRLV QRNLTETGLS CRELENLICQ VEIFDFNLTQ LDIRQESSRH CDALNEILEY LQVLPLSYNQ LSEAQRVSWL TGELQTRRPL IPGELPFSEK TNDVIETFRV VRSLQQEFGI NICQTYIISM CREVSDVLEV LLLAKEARLF DPAIAVGSIR VVPLFETVED LQRSRSVMKK LFELPLYRAF LAGGYEALNS ENTPPDTQPP NSPSSPTLNP NLQEVMLGYS DSNKDSGFLS SNWEIHKAQK SLQKIAEQYG LHLRIFHGRG GSVGRGGGPA YEAILAQPGN SINGRIKITE QGEVLASKYS LLDLALYHVE TITTAVVQAS LLRTGFDDIE PWNEIMEELS MRSRQHYRGL IYEQPDFIDF FHQVTPIEEI SQLQISSRPA RRPSGKKDLS SLRAIPWVFS WTQTRFLLPS WYGLGTALQE FLNEQPEEHL KLLRYFYVKW PFFKMAISKA EMTLAKVDIE MAHHYVQELS NPEDKAQFDK VFEQIASEFY LTRDLVLNIT GHQRLLDGDP ILQRSVQLRN GTIVPLGFIQ VSILKRLRQY KNTTTSGVIN SRYSKGELLR GALLTINGIA AGMRNTG
|
| |