Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0517 |
Symbol | |
ID | 9338303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 532458 |
End bp | 534323 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | |
Product | family 2 glycosyl transferase |
Protein accession | YP_003720156 |
Protein GI | 298489979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00397833 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAAG CTTTGCTAAA TTCACGACCC ACCCATCAGA ATGTACAAAC GACTGTTAAA TTGCAGAATA TTATTTTGCC GAATTTAGAT ATATGCACTG TCGAAGAACT GTATTTTCGA TTAAACTCTG AATTTTCCAT GAATTATGAA CAAAATATCA TTGAGATTAA TAAATCTGAA ATTATCAGTT TTGACACTTA TTTCAATTCT TTCTCAATTC AAAAATGGCA AGAACATACA AATATAAACT CTATCAATAT TAACCTGAAT GTAAAAGGTA AATTTAAGAT TAATCTCCTT AATATCAATT ATTCTTCACA GATCAAGGGA TTAGTACATC AAAAAATAAT AACTAACACC GAACTTAGAG AAGTATGTGT ATTTAATGAT ATAGACACAC AACCATATAA AGGATTATTG TATTTAGAAC TTGAACCCTT GGAAGATAAT TGTATTTTCG CTGGTGGATA TTTTTATGCA AACGCGAACA TTAATAACTT TTGTAAATCA AATCAGAAAA TAGCTATTGT CATCTGTACA TACAAGAGAG AAGCTTATGT AAATAGAAAT GTGTCTTTGT TAGAAACACA TTTATTCTCT CAACCAGACA TAGGAAATAA ATTTGAAGTA TTTATTATTG ATAATGGTAG AACAATCAAA GATTTTTATA ATAGTAAAAT CCACGTCATA CCTAATAAAA ATGCAGGTGG TACTGGTGGA TATTGCAGAG GCATTATAGA AGTTATGAAG CGGAAGTCTG ATTTTTCGCA TATTGTCTTT ATGGATGATG ATGTAGTTAT TAATCCTGAA GTATTCGAGC GTATTTATAA TTTTCAAACT GTCGCTCATA ATCAGAATTT ATGTCTTGGT GGTAGTATGT TACGGTTAGA TACAAAATAT ATTCAATATG AAAATGGAGC AGTTTGGAAT AAAGAAGTAA TTAGATTAAA ACCAGATTTA GACTTGAGAA CTGTAAGAAA TATTTTATTA AATGAAATAG AAGAACACCT TAGTTACAAT GGTTGGTGGT TATTTTGTTT TCCCATAAAA AGTATAGATG ATTCCAAATT ACCTTATCCA TTTTTTATCA AAATGGATGA TATGGAGTTT CCGATTAGGT TAAATCATAA AATTATTACC TTGAATGGTG TGTGTGTTTG GCACGAAGCA TTAGAAAATA AATACTCACC CATGATGAAC TATTACTTAA AAAAGAACGA GTTAATTTTA AATGTCATTG TATCTGATGA CTTTAGTAAA CTAGATGCAA TCAAACGAAT TATTAAATTT ACCCTTAGAG AAGCATTTTG CTATAAATAC CAAAGTGCAA ATGTTATTCT TAAAGCTGCT GCTGATTTCT TGAAGGGTCC CAGACATTTA ACAGAAATTG ACCCAGAAGA GAAAAACATA GAAATTAGAA GCATGGGAGA AAAATGTGTT AAAGATACTG AATTACCTTT TATGTATATC AAGTATGAAG AAAGCGTAAA TAAAATAGAA AGTACAATGC ATCGCTGGCT GAGATTTATT ACTCTAAATG GACATTTGTT ACCTTCTCCA TTTTTCTATC AAGATATTAA GTTAACTGGA CAGGGATACA AAGTAATTCC TATGCAGGAA TATAGACCTA CAAATGTATT TAGAGCCAGA AAAGCTTTGT ATTATAACTT AATCGATCAA GAAGGATTTG TCGTTAGCTT TTCTAGAGAA GAATTCTTTA AAGTTTTGAT GAAAACATTA GCTTTATCGG TAGAAATATA CTTTAAGTTC TCTAAATTGA AACAAGACTA TAGAGAAACA TTACCTGAAC TGACTAATAG AGAGTTTTGG GAAACCTATT TAGAAACTAA TAAATACTCT AAATAG
|
Protein sequence | MSQALLNSRP THQNVQTTVK LQNIILPNLD ICTVEELYFR LNSEFSMNYE QNIIEINKSE IISFDTYFNS FSIQKWQEHT NINSININLN VKGKFKINLL NINYSSQIKG LVHQKIITNT ELREVCVFND IDTQPYKGLL YLELEPLEDN CIFAGGYFYA NANINNFCKS NQKIAIVICT YKREAYVNRN VSLLETHLFS QPDIGNKFEV FIIDNGRTIK DFYNSKIHVI PNKNAGGTGG YCRGIIEVMK RKSDFSHIVF MDDDVVINPE VFERIYNFQT VAHNQNLCLG GSMLRLDTKY IQYENGAVWN KEVIRLKPDL DLRTVRNILL NEIEEHLSYN GWWLFCFPIK SIDDSKLPYP FFIKMDDMEF PIRLNHKIIT LNGVCVWHEA LENKYSPMMN YYLKKNELIL NVIVSDDFSK LDAIKRIIKF TLREAFCYKY QSANVILKAA ADFLKGPRHL TEIDPEEKNI EIRSMGEKCV KDTELPFMYI KYEESVNKIE STMHRWLRFI TLNGHLLPSP FFYQDIKLTG QGYKVIPMQE YRPTNVFRAR KALYYNLIDQ EGFVVSFSRE EFFKVLMKTL ALSVEIYFKF SKLKQDYRET LPELTNREFW ETYLETNKYS K
|
| |