Gene Aazo_0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0647 
Symbol 
ID9338433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp677303 
End bp678793 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content35% 
IMG OID 
ProductAlg9 family protein mannosyltransferase 
Protein accessionYP_003720240 
Protein GI298490063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCC TGTTTAAACC AGACCACCTA TTATTAGGCA TCCTGATTAT TATTGCATTA 
ATACTGCGTG TTGGCATGGC TTTAAAGTTT CCCAATATAT TTTGGGCAGA TGAAATTTTT
CAATCCCTAG AACCTGCACA TAGACTAGTT TTCGGTAATG GTATCGTCAC ATGGGAATTT
AGAGATGGTA TTCGTTCTTG GGTATTACCA GGAATTTTAG CAGGTGTTAT GCACCTTACA
GCATCGATGG GAGAAGGTTC TACTGGATAT TTAATAGGAG TTAATATCTT TTTGTCTCTA
CTTTCCCTGA GTAATATCTT AGTTGCTTAT GTTTGGGGAA AAAAAATAGG AGGAACAATT
ACAGCCCTTA TTTGTGCCGC TATTTGTACT ATATGGTTTG AGTTGATCTA TTTTTCACCC
AAAGCTTTTA CCGAAGTTGT AGCTACTCAT GTCCTCTTAC CTGGAATTTA CTTAGGAGTG
CAAAAAGATT CTATCACTAG AAATCGCCTA TTTTTATCAG GATCTTTGTT AGGAATATCT
TTAGCATTAA GAATTCATCT CATACCAGCT ATAATTTTTG CAGTAGTTTA CATTTGTAAA
CGAGGTTGGC AGCAAAAATG GTTGCCAATG ATAGCAGGTA TTATAGCTCC CGTATTATTG
TTTGGTACTG TGGATGCTTT CACTTGGTCT TATCCTTTTC AATCTTTCTG GTTGAATATT
TGGGTAAATA TTGTTGAAGG TAGAAGTAAA CTATATGGTG TTTCTCCTTG GTATGAATAT
TTTATTTTTT TGTTCAAAAG TTGGTCGTGG CTATCCATAC CTATTATTAT TCTTACTATT
ATAGGTTTTC GTCGTATTCC TATTTTGGGA TGGTTAGCCT TAATTATTAT CTTGTCTCAC
AGTTTCTTGG CTCATAAGGA ATATCGTTTT ATTTATCCAG CATTACCAAT GTTGTTTATA
TTAGCAGGAA TAGGCACAGG TGAGTTAGTT TTAAGATCTT CTGGTAGATG GTCTTCACTG
CACATCAGGA TAATAGCAAT ATTACTCTCT ATTTATCTTT GGAGTTCAAC TTCTATTGCT
CTCTTGAGTA GATTTAATAT TTATGCTCCT TTAAGCTTTT CCACTTTTGG CACGAATTGG
GAAATGACAC ATCTCTATGC TACTGCTAAT AATCTCGTAG TCTTGCAAAG TTTAAGTACA
GAAGAAAATG TATGCGGTCT TGGACTTTGG GGTGTTAATT GGGCCTTATC AGGAGGTTAT
ACTTATTTTC ACCGTGATGT ACCTATATAT CAAGTTGATA CACAAATAGA CTTTGCAGTT
GCCAATTCAG GTTTTAATTA TGTTGTTGGT AATTCTCCTC TACCATCTAC ATATCCAAAT
TATTCTTTGC AGCAATGTAG GCTAGGAACT TGTGTTTATA AACGTCCGGG TTCCTGTAGC
AAAATTAAGG AACGTGAAAT CAATTATGTG CTTAAAATGT CAGGAAATTA G
 
Protein sequence
MNRLFKPDHL LLGILIIIAL ILRVGMALKF PNIFWADEIF QSLEPAHRLV FGNGIVTWEF 
RDGIRSWVLP GILAGVMHLT ASMGEGSTGY LIGVNIFLSL LSLSNILVAY VWGKKIGGTI
TALICAAICT IWFELIYFSP KAFTEVVATH VLLPGIYLGV QKDSITRNRL FLSGSLLGIS
LALRIHLIPA IIFAVVYICK RGWQQKWLPM IAGIIAPVLL FGTVDAFTWS YPFQSFWLNI
WVNIVEGRSK LYGVSPWYEY FIFLFKSWSW LSIPIIILTI IGFRRIPILG WLALIIILSH
SFLAHKEYRF IYPALPMLFI LAGIGTGELV LRSSGRWSSL HIRIIAILLS IYLWSSTSIA
LLSRFNIYAP LSFSTFGTNW EMTHLYATAN NLVVLQSLST EENVCGLGLW GVNWALSGGY
TYFHRDVPIY QVDTQIDFAV ANSGFNYVVG NSPLPSTYPN YSLQQCRLGT CVYKRPGSCS
KIKEREINYV LKMSGN