Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4553 |
Symbol | |
ID | 9342358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4643823 |
End bp | 4645094 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | group 1 glycosyl transferase |
Protein accession | YP_003722939 |
Protein GI | 298492762 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCTA CCACTAACAA ACGCATTGCC TTGATTTCAG TCCACGGTGA TCCGGCGATT GAAATTGGGA AAGAGGAGGC TGGAGGACAA AATGTTTATG TTCGCCAAGT GGGTGAAGCA CTATCCCAGC TAGGATGGCA AGTTGATATG TTTAGCCGCA AAGTGAGTGT TGACCAAGAA GATATCGTTC AACATAATTC TCGTTGTCGA ACCATTCGTT TAACAGCCGG ACCAGTTGAA TTTGTACCAC GAGATAACGG TTTTAAATAC TTGCCAGAAT TTGTGGAGCA GTTATTGGAA TTTCAAAAAC AAAACAGCAT TAAATATGAG TTGGTTCATA CTAACTACTG GCTATCTAGT TGGGTTGGGT TGCAGCTGAA ACAAATCCAA GGAAGTAAAC AGGTTCACAC ATATCATTCT TTAGGAATAG TCAAATACAA CACAATAGAA AATATTCCTC TAGTTGCTAG TCAACGTCTA GCAGTGGAAA AAGAAGTATT GGAAACAGCG GAAAGAATTG TGGCGACAAG TCCGCAAGAA AAACAACACA TGAGAACTCT GGTTTCTCAT CAAGGAAACA TTGATATTAT TCCTTGTGGT ACAGATATTC GCCATTTTGG TTCAGTGGAT AGACAAGCAG CTAGAGAAGC ATTGGGAATT GATCCACAAG CCAAAGTTGT TTTGTATGTA GGGCGTTTTG ACCCACGCAA AGGGATAGAA ACCTTAGTGC GTGCTGTGCG TGAGTCTAAG TTTTATGGTG ATAAAGACTT AAAACTGATT ATTGGTGGTG GAAGTACACC AGGTAACAGT GATGGTAGAG AACGTGATCG CATCGAGGGA ATTGTTAACG AATTGGGAAT GAGTAAATTT ATTTCCCTTC CTGGTCGTCT CAGTCGAGAA GTCTTACCAA CTTATTACGG TGCGTCTGAT GTTTGTGTGG TTCCCAGTCA CTATGAACCC TTTGGACTCG TGGCTGTGGA AGCAATGGCC AGTGGAACAC CAGTTATAGC TAGTGATGTT GGTGGTCTTC AGTTTACCGT TGTTAATGAA AACACTGGCT TATTAGTACC ACCCCAAGAC GTAGCAGCCT TTAGTAACGC CATTGACCGC ATTCTTGGTA ATCCCCAATG GCGTGCACAA CTAGGTCAAT CGGGTAATAG ACGGGTAATG AGTAAGTTTA GCTGGGACGG TGTAGCTAGT CAGTTAGATG CCCTATACAC CCAACTACTG CAACCAGTTA AAGAAAAAGA ACCTGCTTTA GTTAGTAAGT GA
|
Protein sequence | MNSTTNKRIA LISVHGDPAI EIGKEEAGGQ NVYVRQVGEA LSQLGWQVDM FSRKVSVDQE DIVQHNSRCR TIRLTAGPVE FVPRDNGFKY LPEFVEQLLE FQKQNSIKYE LVHTNYWLSS WVGLQLKQIQ GSKQVHTYHS LGIVKYNTIE NIPLVASQRL AVEKEVLETA ERIVATSPQE KQHMRTLVSH QGNIDIIPCG TDIRHFGSVD RQAAREALGI DPQAKVVLYV GRFDPRKGIE TLVRAVRESK FYGDKDLKLI IGGGSTPGNS DGRERDRIEG IVNELGMSKF ISLPGRLSRE VLPTYYGASD VCVVPSHYEP FGLVAVEAMA SGTPVIASDV GGLQFTVVNE NTGLLVPPQD VAAFSNAIDR ILGNPQWRAQ LGQSGNRRVM SKFSWDGVAS QLDALYTQLL QPVKEKEPAL VSK
|
| |