Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_5157 |
Symbol | |
ID | 9342965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5283315 |
End bp | 5284514 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | group 1 glycosyl transferase |
Protein accession | YP_003723338 |
Protein GI | 298493161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAT CAAAACAGCA TATTTGTCAT ATAGTTTTGA GTAATATTGG TGGAGCGCCA CGGACTGCGG ATTCCCTAAT TTCTTCCCAA TCTAAAGCTG GGTACAAAGT CTCTAGCGTA GTATTGACTA ATTTAGATCC TCAATGGATA GTAGCCTTTC AAGCTGCTGA AAAACTAATA ATCATGAAAG TTGCAGGTAG TTTATTTTAT ATCGGTGGAC CGATACACCA ACTATGGATA GCTATCCAAT TAAGGAAAGT AATTTCTGAC CTAAAACCAG ATATTGTTGT TTGCCATACG GCATTTATCA CTAAACTTTT TTATATCTCT CAACTTATTC CTGGCAGTTT TTCAGTTCCC TCTATCAGTT ATATTCATAC TGATGTTATT TCTGAACTAC CTGCTGAAAG CAAAAGCAGG TTATACTCAG TGATAAAACT ATTACAGAAT TTATTTATAC TTATTGATAA TTGGATTAGT GTACGTAGTC TTCAGCAAGC TAGTGGATTA GTATTTGTTT GTAAAAGTCT ATATGAAAGA TTCTTAGATC TGGGCTTAAG TCCTCGTCGC ATAGCAATAT GTTATAATCC AGCAATACCT GATCCAAGTC ATAAGCCGTT AAATGCTACA GCGGAATCGT GGTTCAAAAG CCCTGATTTA ATTACCTTTG TATCTGCTTC CAGATTTCAT CATCAAAAAG ATCATCAAAC ATTACTCAAA GCATTTGCTC AAGCTAGTCA ATATCACTCC AACATCCGGT TAATTTTACT AGGAGATGGT GGTTTAGAAA CACAAATCCA AAAATTAGCG ACTTCTTTAG GAATTAGTAA TCTTGTTTTA TTTGCAGGTA CTGTTACTAA TCCTAGAGCT TACTTCTCAT TATCTAGAGC AGTTATACTT GGTTCTCATT ATGAAGGATT TGGTATGGTG CTTGTAGAAG CTGTAGCTAG TGGGGTAACG TTTATTTCCT CTGATTGTCC TGTTGGTCCC CGTGAGATTT CTGAAGTGCT GCAATGTGGA ACTTTAGTAC CAACAAATGA TGTTGATGCT TTAGCACAAG CAATTATTAC TCATGTAGAA ACACCTAAAG AAATAATAGA CCGTTCTGAG CAAATAGAAA GACTTTTTAG TGAGTCTACC TGTGCCAATA GCTTAGAAAT TTTGCTCCAG GAGGTGTTTG GTGAAAAACT GTATAAATGA
|
Protein sequence | MIKSKQHICH IVLSNIGGAP RTADSLISSQ SKAGYKVSSV VLTNLDPQWI VAFQAAEKLI IMKVAGSLFY IGGPIHQLWI AIQLRKVISD LKPDIVVCHT AFITKLFYIS QLIPGSFSVP SISYIHTDVI SELPAESKSR LYSVIKLLQN LFILIDNWIS VRSLQQASGL VFVCKSLYER FLDLGLSPRR IAICYNPAIP DPSHKPLNAT AESWFKSPDL ITFVSASRFH HQKDHQTLLK AFAQASQYHS NIRLILLGDG GLETQIQKLA TSLGISNLVL FAGTVTNPRA YFSLSRAVIL GSHYEGFGMV LVEAVASGVT FISSDCPVGP REISEVLQCG TLVPTNDVDA LAQAIITHVE TPKEIIDRSE QIERLFSEST CANSLEILLQ EVFGEKLYK
|
| |