Gene Aazo_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1696 
Symbol 
ID9339489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1758379 
End bp1761432 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content43% 
IMG OID 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_003720970 
Protein GI298490793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCCC TTTTATACTC TTCATCCCCA GCAGTGAATC TTTACCCCTC GGAATTATTC 
TTGCGTCATC GTCTACAGGT AGTAGAGGAA TTGTGGGAGT CAGTTCTTCG GCAAGAATGT
GGTCAAAAGA TGGTAGATCT ATTGGGACAA TTGCGCGATT TGTGTTCCCC AGAAGGACAA
GCTACTCATG ACCAAACCGC CTCTGCTGTG GAGTTAATTG AACAACTGAA TATCAACGAG
GCTATTCGTG CTGCTCGTGC TTTTGCTCTT TATTTTCAGT TGATTAATAT CATAGAGCAG
GAATACGAAC AAAAGCAGCA GTTAACTCGC TATTCTGATT CAGGCACAAT CAATCAGGAA
CATCTCGCCA ATATTATTTA TTCCACTAAC CAAAGAGAAG ACGATTTACC TGTAACTAAG
GAACTAGGAG CAGATTCCCT ATCACAAAGT TGGACAGACA CTACGCCAAT TAAACAAAAA
GGCACATTTG CGGCATTATT TCCCCTGTTG TTTAAACTGA ATGTACCACC CCAGCAAATT
CAACGGTTGA TTTCTCAACT AGATATTCGC TTGGTTTTCA CGGCGCACCC CACGGAAATT
GTCCGTCATA CGATCCGAGA TAAACAGCGA CAGGTAGTAG ACCTCTTGCA ACATCTGGAT
AGCTTGCAAA ATCGTTCTGG TGGCTATCCT TGGGAAGCTC AAGAAGTGAA AGAGCGTTTA
TTGGAAGAAA TCCGCCTGTG GTGGCGTACA GATGAACTGC ACCAGTTCAA ACCAACGGTG
CTGGATGAAG TAGATTATGC TCTGCACTAT TTCCAAGAAG TCTTATTTGA TGGTATTCCC
CAACTGTATA AACGTCTCAA ATATTCCCTA GAACAAACAT TTCCTTGGTT AGAACCACCA
AGTAAAAATT TCTGTTCCTT TGGTTCTTGG GTAGGTTCAG ATAGGGATGG AAATCCGTCA
GTGACACCAG AAGTTACATG GAAAACAGCT TGTTATCAGC GGAAAATGGT GTTGGGAAGA
TATATTCAGT CGGTGAAGCA GCTGATTGAA TTATTAAGTG TGTCCATGCA GTGGAGTGAT
GTGTTGCCAG ATTTGCTGGA GTCACTGGAG TTAGATCAGT CTACGATGAG TGATGTATAT
GATGCTCTGG CGTTGCGCTA TCGTCAAGAA CCATATCGCT TAAAGTTGGC CTATGTGCTG
AGAAGATTGG AAAATACACG CGATCACAAT CTGGCTTTAT ATAGTCGAGA AAGACCAGCA
AATGAAGATT CCCCCATGTA TCGTTCAGGG GCTGAATTTT TATCAGAACT GCGGTTGGTT
CAACGCAATT TGACAGAAAC GGGTTTAAGC TGTCGAGAGT TAGAAAATCT CATATGTCAA
GTGGAAATTT TTGACTTTAA CCTGACTCAG CTAGATATTA GGCAAGAATC ATCTCGTCAT
TGTGATGCAC TGAATGAGAT TCTCGAATAC CTGCAAGTTT TACCCCTATC TTATAACCAA
CTATCAGAAG CTCAAAGAGT GTCTTGGTTA ACTGGGGAAC TGCAAACAAG ACGGCCGTTA
ATTCCTGGAG AGTTGCCATT TTCAGAAAAA ACCAATGATG TAATTGAAAC CTTCCGAGTT
GTGCGATCAC TACAACAAGA ATTTGGCATC AACATCTGTC AAACTTACAT TATCAGTATG
TGCCGGGAAG TCAGCGATGT TTTGGAAGTT CTGCTCTTAG CCAAAGAAGC CAGACTATTT
GATCCAGCGA TCGCTGTAGG TTCAATTAGA GTCGTCCCAC TATTTGAGAC TGTAGAAGAC
TTACAACGCT CTAGAAGCGT GATGAAAAAA CTTTTTGAAC TCCCCCTATA TCGCGCCTTC
TTAGCTGGTG GCTATGAAGC ACTTAACTCC GAAAATACTC CCCCAGATAC CCAACCACCC
AACTCTCCAT CTTCACCCAC CCTGAACCCC AACTTGCAAG AAGTGATGCT GGGGTATTCT
GACAGTAATA AGGATTCTGG TTTCTTAAGC AGCAACTGGG AAATTCACAA AGCCCAAAAA
TCACTCCAGA AAATTGCCGA ACAATATGGC TTACACCTGC GGATTTTCCA CGGACGCGGC
GGTTCTGTAG GTCGGGGTGG TGGCCCTGCT TATGAAGCGA TTTTGGCTCA ACCTGGTAAC
AGTATTAATG GACGCATCAA GATTACTGAA CAAGGAGAAG TTTTAGCTTC TAAATATTCC
TTGCTGGACT TGGCTTTATA TCATGTAGAA ACCATCACAA CTGCGGTAGT TCAAGCTAGT
TTGTTGCGGA CAGGGTTTGA TGATATTGAA CCATGGAATG AGATCATGGA AGAATTGTCA
ATGCGATCGC GCCAACATTA TCGCGGTCTA ATTTACGAAC AACCCGATTT TATCGACTTC
TTCCACCAAG TCACCCCCAT TGAAGAAATC AGCCAACTGC AAATTAGTTC GCGTCCAGCG
CGACGACCAT CGGGTAAAAA AGATTTAAGC AGTTTGCGCG CTATTCCTTG GGTATTCAGC
TGGACACAAA CCCGATTCTT GTTACCTTCT TGGTATGGCT TAGGTACAGC TTTACAAGAG
TTCTTGAACG AACAGCCAGA AGAACACCTG AAATTGCTGC GCTATTTTTA TGTTAAATGG
CCTTTCTTCA AAATGGCAAT TTCTAAAGCG GAAATGACCT TGGCAAAAGT AGACATTGAA
ATGGCACATC ATTACGTCCA GGAACTATCC AACCCAGAAG ACAAAGCCCA GTTTGATAAA
GTATTTGAGC AAATTGCTAG TGAATTTTAT CTAACTAGAG ATTTGGTCTT AAATATCACT
GGACACCAAC GACTTTTAGA CGGTGATCCC ATCTTGCAAC GTTCCGTACA ATTACGTAAT
GGGACAATTG TGCCATTAGG ATTTATACAA GTTTCTATCC TGAAGCGTTT GAGACAGTAC
AAAAACACCA CGACCTCTGG AGTAATTAAC TCCCGTTACA GCAAAGGAGA GTTGCTTAGA
GGAGCATTAT TAACCATTAA CGGTATTGCT GCAGGAATGA GAAATACAGG TTGA
 
Protein sequence
MGSLLYSSSP AVNLYPSELF LRHRLQVVEE LWESVLRQEC GQKMVDLLGQ LRDLCSPEGQ 
ATHDQTASAV ELIEQLNINE AIRAARAFAL YFQLINIIEQ EYEQKQQLTR YSDSGTINQE
HLANIIYSTN QREDDLPVTK ELGADSLSQS WTDTTPIKQK GTFAALFPLL FKLNVPPQQI
QRLISQLDIR LVFTAHPTEI VRHTIRDKQR QVVDLLQHLD SLQNRSGGYP WEAQEVKERL
LEEIRLWWRT DELHQFKPTV LDEVDYALHY FQEVLFDGIP QLYKRLKYSL EQTFPWLEPP
SKNFCSFGSW VGSDRDGNPS VTPEVTWKTA CYQRKMVLGR YIQSVKQLIE LLSVSMQWSD
VLPDLLESLE LDQSTMSDVY DALALRYRQE PYRLKLAYVL RRLENTRDHN LALYSRERPA
NEDSPMYRSG AEFLSELRLV QRNLTETGLS CRELENLICQ VEIFDFNLTQ LDIRQESSRH
CDALNEILEY LQVLPLSYNQ LSEAQRVSWL TGELQTRRPL IPGELPFSEK TNDVIETFRV
VRSLQQEFGI NICQTYIISM CREVSDVLEV LLLAKEARLF DPAIAVGSIR VVPLFETVED
LQRSRSVMKK LFELPLYRAF LAGGYEALNS ENTPPDTQPP NSPSSPTLNP NLQEVMLGYS
DSNKDSGFLS SNWEIHKAQK SLQKIAEQYG LHLRIFHGRG GSVGRGGGPA YEAILAQPGN
SINGRIKITE QGEVLASKYS LLDLALYHVE TITTAVVQAS LLRTGFDDIE PWNEIMEELS
MRSRQHYRGL IYEQPDFIDF FHQVTPIEEI SQLQISSRPA RRPSGKKDLS SLRAIPWVFS
WTQTRFLLPS WYGLGTALQE FLNEQPEEHL KLLRYFYVKW PFFKMAISKA EMTLAKVDIE
MAHHYVQELS NPEDKAQFDK VFEQIASEFY LTRDLVLNIT GHQRLLDGDP ILQRSVQLRN
GTIVPLGFIQ VSILKRLRQY KNTTTSGVIN SRYSKGELLR GALLTINGIA AGMRNTG