Gene Aazo_4707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4707 
Symbol 
ID9342514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4809854 
End bp4811302 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content36% 
IMG OID 
Productneutral invertase 
Protein accessionYP_003723033 
Protein GI298492856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTAA ATGAATTAAG CGCAACTGAA AATATAGAAA AAGAGGCATG GCAAGCACTA 
GAAAATTCCA TCCTCTACTA CAAAGGTCGT CCCATTGGGA CTTTAGCAGC TTATGATCCA
TCTGTAGAGG CTTTGAATTA TGACCAGTGT TTTATTAGAG ATTTTATTTC TTCTGCATTG
ATTTTTCTCA TTAAAGGGAG GACAGATATT GTTAGAAATT TCCTAGAGGA AACCTTGAAT
TTACAGCCTA AAGAAAAGGC TTTGGATGCT TATAAACCCG GTAGAGGGCT AATTCCAGCC
AGTTTTAAAG TGGTATCTAT CAATGGCGAA GAACATTTAG AAGCTGATTT TGGTGAACAT
GCGATCGCCA GAGTTACACC AGTTGATTCT TGTTTATGGT GGCTGATTTT ATTACGTGCT
TATGTAGTCG CTACAAATGA TTATTCACTA GCATATCAGC CTGAATTCCA AACAGGAATT
AGGTTAATTA TGGATATATG TTTGGCAAAT CGTTTTGATA TGTACCCGAC GCTATTAGTT
CCTGATGGCG CGTGTATGAT AGATAGACGT ATGGGGATTT ATGGACATCC TCTAGAAATT
CAAGTTCTAT TTTTCGCTGC ATTGCGTGTA GCCAGAGAAC TATTAATTTG TCAAGGAAAT
CAAGATATAG TTGAGGCAAT TGATAACAGA TTACCCCTAT TATGTGGTCA TATTCGTCAG
TATTATTGGA TAGATATTAA TCGCTTGAAT GCTATTTATC GCTTTAAAAG CGAAGAATAC
GGTAAAACTG CTGTCAATCT TTTCAATATT TATGCAGATT CTTTGCCTTA TTATGAATTG
GACAAATGGC TACCAAAAAT AGGTGGTTAT TTTGCTGGAA ATGTCGGACC ATCACAATTA
GATACTCGTT TCTTCACTCT CGGAAACTTG ATGGCAGTGA TTTGTGATTT GTCTAGTGAA
GAACAGTCCC AAGCAATTAT TAATCTCATC GAAAAACGAT GGGAAGATTT AGTAGCAGAT
ATGCCTATGA AAATCTGTTA TCCTGCCTTA CAAGGTGAAG AATATAGAGT TGTGACAGGA
TGTGATCCAA AAAACATACC TTGGTCATAT CATAATGCTG GTAGCTGGCC TGTTTTAATG
TGGATGTTAG CAGCAGCAGC AGTGAAAACC AAGAAACCAT ATTTGGCTGA AAAAGCTATT
AAAATTGCTA AAGTCAGACT AAGTGAAGAT CAATGGCCTG AATATTACGA TGGTAAGAAA
GGTAGATTAA TTGGTAAACA AGCTAGAAAA TATCAAACCT GGACAATTGC AGGGTATTTA
TTAGCGCAAG AACTAATAGA TAATCCTGAT TATTTACCAT TAATTAGTTT TGATAAATTA
CCGCTAGAAA CAATTTCTAG AGCCTGTGAA TTTGAAGTTA CTGGTTTAGA TCCTTATATG
AATCTGTAA
 
Protein sequence
MQLNELSATE NIEKEAWQAL ENSILYYKGR PIGTLAAYDP SVEALNYDQC FIRDFISSAL 
IFLIKGRTDI VRNFLEETLN LQPKEKALDA YKPGRGLIPA SFKVVSINGE EHLEADFGEH
AIARVTPVDS CLWWLILLRA YVVATNDYSL AYQPEFQTGI RLIMDICLAN RFDMYPTLLV
PDGACMIDRR MGIYGHPLEI QVLFFAALRV ARELLICQGN QDIVEAIDNR LPLLCGHIRQ
YYWIDINRLN AIYRFKSEEY GKTAVNLFNI YADSLPYYEL DKWLPKIGGY FAGNVGPSQL
DTRFFTLGNL MAVICDLSSE EQSQAIINLI EKRWEDLVAD MPMKICYPAL QGEEYRVVTG
CDPKNIPWSY HNAGSWPVLM WMLAAAAVKT KKPYLAEKAI KIAKVRLSED QWPEYYDGKK
GRLIGKQARK YQTWTIAGYL LAQELIDNPD YLPLISFDKL PLETISRACE FEVTGLDPYM
NL