Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3925 |
Symbol | |
ID | 9341729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 3987887 |
End bp | 3989152 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | group 1 glycosyl transferase |
Protein accession | YP_003722551 |
Protein GI | 298492374 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC AACCTCTTCG CATCGCCTTA TTTACAGGAT TGTTTCCTCC ATTCTTAACA GGAGTTTCAG TCGCAGTACA TCAGCGAGTA CGTTGGTTAC TTGAACAAGG ACATCAAGTT TTTCTCATCC ACCCAGAAAT AAACAATCAG TATCCTAAAA TAGTTAGTAA TCGTCCCATG CCGGGACTAG AAGAACTACA ATCTTTCCCT GGATTTTCAT CTTACGCCTT TCCTACACAA CCACTAATCT TCTACAAATC GCTACCTCAA CCACTCAACT ATCGCCATTG GAGCGATACT AAATTACTAG AGAAATTTCA GCCTGATATT ATCATTGTTG AAGAAGCAGC ACAGATGAGG GGGTTATACT CAATTTTCTT GCAAGGCTAT GGTCGGCCGA TAGGAGTTGA ATACGCTAAA CGTACCAAAA CCCCAATTAT CTCCGTGTTT CATACTGATA TCGTGGCTTA TATCCGATAT TATTTAGGAG ATGTATTCTT CAGTTTACTA CGCCCAATCG TTCCTCTTTT AGTGAAGCAG TTTAGTAATG CGTATAGTCT CAATTTATTT CCATCTAGAG AACAACTATC TAAATACCAA AAGCTCAAAT GTAAACGGGT TGAATACGTT CCTTATCAAG GAATTAATTG TGAAAAATTT CACCCCCGGA ACATCTGTTA TGACCCAAGA CCTAATGATC AACGCCCAAC TATTCTCTTC GTTGGACGCA TCACAGCAGA GAAAAATGTC ACTCAACTTT TAGATGCATT TCCATTCATT GCTGCTAAAA TTCCCGATGT CCACTTAGTT ATTATTGGCA GCGGACCTTT AGATCAAGAA ATTCGTCGCC GCGCTCAAGC TTTCCCATTT GGAGTAACAA TTTGGGGTGA ATCTCACGGT ACCGAACTTT TGGGATGGTT CGCTAGAGCC GATGTTTTTG TTAACCCTTC AGTCACGGAA AACTTCTGCA CTACAAATAA CGAAGCTTTA GCTTCTGGAA CTCCTGTAGT TGCAGCTATC GCTCCTTCAA CTCCTGAACA AGTGATCATT GGTTATAATG GCTTTCTTGC TCAACCCAAC AACCCCAAAG ATTTTGCTGA GAAAATAATT AAAATTCTCG AAAATTCTGA CCTCAAAGCA CAATTATCTA AGCAATCTCG TCCTTCAATA TTAGAATTTG ATTGGTCAGT ATGTAGCGAA AAATTTGAAG ATAAGCTCTA CCAATTAGTT GGAATACCCA AAATAGTTGA GTTATCTAAT AATTAG
|
Protein sequence | MKKQPLRIAL FTGLFPPFLT GVSVAVHQRV RWLLEQGHQV FLIHPEINNQ YPKIVSNRPM PGLEELQSFP GFSSYAFPTQ PLIFYKSLPQ PLNYRHWSDT KLLEKFQPDI IIVEEAAQMR GLYSIFLQGY GRPIGVEYAK RTKTPIISVF HTDIVAYIRY YLGDVFFSLL RPIVPLLVKQ FSNAYSLNLF PSREQLSKYQ KLKCKRVEYV PYQGINCEKF HPRNICYDPR PNDQRPTILF VGRITAEKNV TQLLDAFPFI AAKIPDVHLV IIGSGPLDQE IRRRAQAFPF GVTIWGESHG TELLGWFARA DVFVNPSVTE NFCTTNNEAL ASGTPVVAAI APSTPEQVII GYNGFLAQPN NPKDFAEKII KILENSDLKA QLSKQSRPSI LEFDWSVCSE KFEDKLYQLV GIPKIVELSN N
|
| |