Gene Aazo_5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5089 
Symbol 
ID9342897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5217794 
End bp5220292 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content43% 
IMG OID 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003723300 
Protein GI298493123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTGGCA TTGTTATTGT TTCGCACAGT AAACAATTAG TTCTAGGAGT ACAAGAACTA 
GCTGCACAGA TGGTTCAAGG ACAAGTTCCC CTTGCTGTGG CAGCAGGCAT TGATGATCCA
GAAAATCCAC TAGGTACAGA TGCGATTAAG GTTTATGAAG CGATTGCCTC TGTATTCTCT
GATCATGGTG TTCTCGTATT GATGGATTTA GGTAGTGCTT TAATAAGTGC GGAAATGGCA
CTGGAGTTTC TACCCCCTGA ACAACGAGAT CACGTATATC TGTGTGCAGC ACCGTTAGTA
GAAGGTGCTA TTGCTGCGAC TATCGCTGCT GCTACGGGTG TAAGTATCCA GCAAGTTATG
GCGGAAGCAC AAGGAGCATT AGTAGCAAAA GAAACACAAC TAGGTTTAAT TAGTAGTCCA
TTATCAACGG TCAGTCGCCA ACCTACAAAT AATTCCGAAA TTCCCACCAA CGAAATACGG
TTAAAAGTCC GTAACCGCTT GGGTTTACAC GCCCGTCTAT CAGCCCAGTT TGTTGCCACT
GCATCCAAAT TTCAAGCGCA AATCAGGGTA CAGAATCTAA CTAGAAATAC AGAACCTGTC
CGGGGTGACA GTATTAACCA AGTTGCTACC TTGGGAGTAC GTCAAGGACA TGAATTGCTG
ATTACTGCCA CTGGAGTTGA TGCCCAGGAG GCGCTAAAAG CTTTACAGAA ATTGATTATC
AACAATTTTG GGGAAGATGA TAGCATTGTC GGATTAACAC CACCACCTCA AGAATTTACC
CCACCGACCC AGGGTGAACT GTGGGGAATT GCGGCTTCTC CAGGAATTGT GATCGCCCCA
TTAGTTCATT ATCAATCTAC CGCTGTAGCT TGTACGGAAT ATCACATAGA AAATGTGAAT
GTAGAGTGGC AACGATTACA AACAGCTATT CAAATTGCTA AACAGGAAAT TGCAACTTTA
CTCTCCCACA CATCTATTCA AATTGGTGAT GCGGAAGCGG CAATTTTTGA TGCTCATCTC
CTATTTTTAG CAGATCCTGT GATGCTAGAA GCCGTACGTC AGCATATTGT GGAAGAACGT
CTCAACGGAG AAGTAGCTTG GCAAGCTGTA GTGGATGAAG TAGCAAATTC CTACCGCAAA
CTAGAGGATT CTTATTTACA AGAACGAGTT GACGATGTGG TTGATGTGGG ACAGAGGGTA
TTAAGAATAT TACTAGGTAA TGCTCCCACT GACTTGGAAC TTACAGAACC ATCTATTGTA
GTAGGGATAG ATTTAAGTCC TTCAGATACT GTTAAGCTTG ATCCAAGCAA AGTGATGGGT
ATTTGCATGA CCTCCGGAAG TGCAACTTCC CATAGTGCTA TTATCGCTCG AACCCTTGGT
ATTCCTGCTG TTTTGGGTAT AGATGCCCAG GTCTTAAACT TGCAAAGTGG TACATTGATG
GCGCTTGATG GTGAAAGTGG CAAGGTTTGG GTGGCACCAG CAACTGAGAC ATTAGATAGA
CTAGAAGCCA AGCGAGAGAC TTGGAAAATT GCCCAAGCAG AAGCACGAAA GCTGGCACAT
CAACCAGCAG TTACTCGTGA TGGTTGCTAC ATTAAAGTAT TGGCAAATAT TGGTAGTATT
ACGGATGCAG AATTAGCTGT AAATCATGGT GCAGAAGGTG TGGGATTACT TCGTACAGAG
TTTTTGTACT TGGACAGAAC AAGCGCACCT ACAGAAGAGG AACAACTGAA AGTTTATCAA
GCGATTACGC AAGTTTTAGA TAAGCAACCG TTAATTATTC GCACTTTGGA TATAGGTGGA
GATAAACAAC TTCCTTATTT GAGTTTGTCT GTTACAGAAG CTAACCCGTT TTTAGGTGTG
CGCGGGATTC GGTTCTGTTT AGAAAATCCT CAGTTGTTGA AAACTCAGTT ACGGGCGATT
TTACGCGCTA GTGTAGGGCA TAATATCAAA ATTATGTGGC CGATGATTGC TACTTTAACA
GAATTACGAG CAGCTAAGGC AATTTTCAAT CAGGTACAGG AGGAACTGCG ACAAGCTGGT
ATCCCTTTTG ATGAAAATAT GAATGTGGGG ATGATGATAG AAACACCCGC AGCAGTAGCT
ATAGCTGATC AGTTAGCCAG AGCAGTTGAC TTTTTCAGTA TTGGTACAAA TGATTTGAGT
CAGTATGTTA TGGCTGGCGA TCGCACAAAT CCTAGAGTAG CAACTTTAGC GGATGCTTTA
CAACCAGCGG TATTAAGAAT GATTCAGCGA ACTGTCCATG CTGCCCATGC TGCTAATATT
TGGGTAGGAC TATGTGGAGA AGTAGCAGCA GAAACCTTAG TTGCACCCAT TTTATTAGGT
TTGGGATTAG ATGAATTGAG TGTCAATCCT CAAGCAATCG CCCCGCTAAA ACGGGCAATA
TCACAGCTAA CAATCACAGA AGCACAAGCT ATAGCCAATA TAGCATTAGA ACAAGATTCT
GCAACTAGTG TCAGAGAGTT AGTTTATCCT ATTGGGTAA
 
Protein sequence
MVGIVIVSHS KQLVLGVQEL AAQMVQGQVP LAVAAGIDDP ENPLGTDAIK VYEAIASVFS 
DHGVLVLMDL GSALISAEMA LEFLPPEQRD HVYLCAAPLV EGAIAATIAA ATGVSIQQVM
AEAQGALVAK ETQLGLISSP LSTVSRQPTN NSEIPTNEIR LKVRNRLGLH ARLSAQFVAT
ASKFQAQIRV QNLTRNTEPV RGDSINQVAT LGVRQGHELL ITATGVDAQE ALKALQKLII
NNFGEDDSIV GLTPPPQEFT PPTQGELWGI AASPGIVIAP LVHYQSTAVA CTEYHIENVN
VEWQRLQTAI QIAKQEIATL LSHTSIQIGD AEAAIFDAHL LFLADPVMLE AVRQHIVEER
LNGEVAWQAV VDEVANSYRK LEDSYLQERV DDVVDVGQRV LRILLGNAPT DLELTEPSIV
VGIDLSPSDT VKLDPSKVMG ICMTSGSATS HSAIIARTLG IPAVLGIDAQ VLNLQSGTLM
ALDGESGKVW VAPATETLDR LEAKRETWKI AQAEARKLAH QPAVTRDGCY IKVLANIGSI
TDAELAVNHG AEGVGLLRTE FLYLDRTSAP TEEEQLKVYQ AITQVLDKQP LIIRTLDIGG
DKQLPYLSLS VTEANPFLGV RGIRFCLENP QLLKTQLRAI LRASVGHNIK IMWPMIATLT
ELRAAKAIFN QVQEELRQAG IPFDENMNVG MMIETPAAVA IADQLARAVD FFSIGTNDLS
QYVMAGDRTN PRVATLADAL QPAVLRMIQR TVHAAHAANI WVGLCGEVAA ETLVAPILLG
LGLDELSVNP QAIAPLKRAI SQLTITEAQA IANIALEQDS ATSVRELVYP IG