Gene Aazo_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0517 
Symbol 
ID9338303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp532458 
End bp534323 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content29% 
IMG OID 
Productfamily 2 glycosyl transferase 
Protein accessionYP_003720156 
Protein GI298489979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00397833 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAAG CTTTGCTAAA TTCACGACCC ACCCATCAGA ATGTACAAAC GACTGTTAAA 
TTGCAGAATA TTATTTTGCC GAATTTAGAT ATATGCACTG TCGAAGAACT GTATTTTCGA
TTAAACTCTG AATTTTCCAT GAATTATGAA CAAAATATCA TTGAGATTAA TAAATCTGAA
ATTATCAGTT TTGACACTTA TTTCAATTCT TTCTCAATTC AAAAATGGCA AGAACATACA
AATATAAACT CTATCAATAT TAACCTGAAT GTAAAAGGTA AATTTAAGAT TAATCTCCTT
AATATCAATT ATTCTTCACA GATCAAGGGA TTAGTACATC AAAAAATAAT AACTAACACC
GAACTTAGAG AAGTATGTGT ATTTAATGAT ATAGACACAC AACCATATAA AGGATTATTG
TATTTAGAAC TTGAACCCTT GGAAGATAAT TGTATTTTCG CTGGTGGATA TTTTTATGCA
AACGCGAACA TTAATAACTT TTGTAAATCA AATCAGAAAA TAGCTATTGT CATCTGTACA
TACAAGAGAG AAGCTTATGT AAATAGAAAT GTGTCTTTGT TAGAAACACA TTTATTCTCT
CAACCAGACA TAGGAAATAA ATTTGAAGTA TTTATTATTG ATAATGGTAG AACAATCAAA
GATTTTTATA ATAGTAAAAT CCACGTCATA CCTAATAAAA ATGCAGGTGG TACTGGTGGA
TATTGCAGAG GCATTATAGA AGTTATGAAG CGGAAGTCTG ATTTTTCGCA TATTGTCTTT
ATGGATGATG ATGTAGTTAT TAATCCTGAA GTATTCGAGC GTATTTATAA TTTTCAAACT
GTCGCTCATA ATCAGAATTT ATGTCTTGGT GGTAGTATGT TACGGTTAGA TACAAAATAT
ATTCAATATG AAAATGGAGC AGTTTGGAAT AAAGAAGTAA TTAGATTAAA ACCAGATTTA
GACTTGAGAA CTGTAAGAAA TATTTTATTA AATGAAATAG AAGAACACCT TAGTTACAAT
GGTTGGTGGT TATTTTGTTT TCCCATAAAA AGTATAGATG ATTCCAAATT ACCTTATCCA
TTTTTTATCA AAATGGATGA TATGGAGTTT CCGATTAGGT TAAATCATAA AATTATTACC
TTGAATGGTG TGTGTGTTTG GCACGAAGCA TTAGAAAATA AATACTCACC CATGATGAAC
TATTACTTAA AAAAGAACGA GTTAATTTTA AATGTCATTG TATCTGATGA CTTTAGTAAA
CTAGATGCAA TCAAACGAAT TATTAAATTT ACCCTTAGAG AAGCATTTTG CTATAAATAC
CAAAGTGCAA ATGTTATTCT TAAAGCTGCT GCTGATTTCT TGAAGGGTCC CAGACATTTA
ACAGAAATTG ACCCAGAAGA GAAAAACATA GAAATTAGAA GCATGGGAGA AAAATGTGTT
AAAGATACTG AATTACCTTT TATGTATATC AAGTATGAAG AAAGCGTAAA TAAAATAGAA
AGTACAATGC ATCGCTGGCT GAGATTTATT ACTCTAAATG GACATTTGTT ACCTTCTCCA
TTTTTCTATC AAGATATTAA GTTAACTGGA CAGGGATACA AAGTAATTCC TATGCAGGAA
TATAGACCTA CAAATGTATT TAGAGCCAGA AAAGCTTTGT ATTATAACTT AATCGATCAA
GAAGGATTTG TCGTTAGCTT TTCTAGAGAA GAATTCTTTA AAGTTTTGAT GAAAACATTA
GCTTTATCGG TAGAAATATA CTTTAAGTTC TCTAAATTGA AACAAGACTA TAGAGAAACA
TTACCTGAAC TGACTAATAG AGAGTTTTGG GAAACCTATT TAGAAACTAA TAAATACTCT
AAATAG
 
Protein sequence
MSQALLNSRP THQNVQTTVK LQNIILPNLD ICTVEELYFR LNSEFSMNYE QNIIEINKSE 
IISFDTYFNS FSIQKWQEHT NINSININLN VKGKFKINLL NINYSSQIKG LVHQKIITNT
ELREVCVFND IDTQPYKGLL YLELEPLEDN CIFAGGYFYA NANINNFCKS NQKIAIVICT
YKREAYVNRN VSLLETHLFS QPDIGNKFEV FIIDNGRTIK DFYNSKIHVI PNKNAGGTGG
YCRGIIEVMK RKSDFSHIVF MDDDVVINPE VFERIYNFQT VAHNQNLCLG GSMLRLDTKY
IQYENGAVWN KEVIRLKPDL DLRTVRNILL NEIEEHLSYN GWWLFCFPIK SIDDSKLPYP
FFIKMDDMEF PIRLNHKIIT LNGVCVWHEA LENKYSPMMN YYLKKNELIL NVIVSDDFSK
LDAIKRIIKF TLREAFCYKY QSANVILKAA ADFLKGPRHL TEIDPEEKNI EIRSMGEKCV
KDTELPFMYI KYEESVNKIE STMHRWLRFI TLNGHLLPSP FFYQDIKLTG QGYKVIPMQE
YRPTNVFRAR KALYYNLIDQ EGFVVSFSRE EFFKVLMKTL ALSVEIYFKF SKLKQDYRET
LPELTNREFW ETYLETNKYS K