Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4918 |
Symbol | |
ID | 9342725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5033784 |
End bp | 5034884 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | group 1 glycosyl transferase |
Protein accession | YP_003723177 |
Protein GI | 298493000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATG CCCAGATTCA AAAAATTAGT GTCATTTTCT ATTCTATACT GCCATCTCCA TATCAAAGAG ATTTGTTCTA CGCCATATCT CAACATCCTA GTTTAAACTT AAAAATCTTA TACTTAGAAT CAGCTTATGT CGATTCTCCT TGGCCTGAAA AACCCTTGCA ACCTTATGAA CAAGTATTAC CAGGGTTTCA TTTAGCTTGG GGTTTATCCC GATTTCATTT TAACTGGCAT TTTCCATCAA CCACTGGAGT TGATGTGGTG GTGCTAAATG GCTATATGAA TGTAACTACT CAATTGTTAC TCAGATTGCA CGCTCAAAAA ATTCCTTGTA TTTTTTGGGG TGAAAAAATG GTGGGTGGTT CACAGGGAAT TAAAGAAAAA TTACAAAAAT ACTTAGCAGG TAGTTTAAAA TATTGTCGAG CGATCGCTGC CATTGGCACT AAAGCACAAC AAGACTACCA ACAACGTTTT CCTGATCAAC CTATTTTTAA TATTCCTTAC TATTGTGATT TAAGTACTTT TAGCCAAGAT TTACCCCAGC GTCCACGCAA TCCCGTTACT ATTCTATTTT GTGGTCAGAT AATTGCCCGC AAGGGAGTTG ATGTTCTCAC TCAAGCATTT GATAATCTGA TTCAAGCTGG TTTAGATGCT CGTTTATTGC TGGTGGGAAG AGAAGCAGAA CTTCCTCAAT TGTTAGAATT ATTACCCATA ATAACTCGGC AAAAAATTGA ATACGCAGGA TTTCAATCTC CTGAAAATTT GCCTCATTTC TTTCGAGAAG CTGATATATT TGTCTTACCT AGTCGATATG ATGGCTGGGG AGTAGTTGTT AATCAAGCTC TTGGGGCTGG ATTACCTATT ATTTGTTCTG ATACAGTTGG TGCTGCTTAT GATTTAGTTG AAACGGGTAA AAATGGCTTT CTATTTCCAT CAGGTGATGT TGTTAGTCTA ACTGAAATTT TAATATATTA TTTAAACAAT CCAGAAACAA TTGCAGCCGC AAGCGAACAA TCACTAAAAA AAGCAATTAA TTTTTCGCTA CAGGCAGGTG CAGAAAGTTG GATTGAAGTG TTTCAAAAAG TGGCTCGATA G
|
Protein sequence | MKYAQIQKIS VIFYSILPSP YQRDLFYAIS QHPSLNLKIL YLESAYVDSP WPEKPLQPYE QVLPGFHLAW GLSRFHFNWH FPSTTGVDVV VLNGYMNVTT QLLLRLHAQK IPCIFWGEKM VGGSQGIKEK LQKYLAGSLK YCRAIAAIGT KAQQDYQQRF PDQPIFNIPY YCDLSTFSQD LPQRPRNPVT ILFCGQIIAR KGVDVLTQAF DNLIQAGLDA RLLLVGREAE LPQLLELLPI ITRQKIEYAG FQSPENLPHF FREADIFVLP SRYDGWGVVV NQALGAGLPI ICSDTVGAAY DLVETGKNGF LFPSGDVVSL TEILIYYLNN PETIAAASEQ SLKKAINFSL QAGAESWIEV FQKVAR
|
| |