Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_5072 |
Symbol | |
ID | 9342881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5196353 |
End bp | 5198170 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | family 39 glycosyl transferase |
Protein accession | YP_003723290 |
Protein GI | 298493113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTAA AATTGAGCAA TCGCTCCTCT GTTGACCAAT GGATTAATAA GATAGAACAG CGTCCAGCCC TTGCTGTGAC TATTTCAATA GTGTGGTTGC TGTTGATTAA TTACATCGCC TTTGTCTGGA ATTTGGGCAA TATTGGCTTA ATTGACGAAA CTGAGCCGCT GTTTGCAGAA GCTTCCCGGC AAATGCTAGT TACAGGTGAT TGGATTACAC CCTTTTTTAA TGGTGAAACT CGTTTTGACA AACCAGCGTT AATTTACTGG TGTCAAGCGC TCGCCTACTC TATTATGGGG GTGAATGAAT GGGCAGCACG CATACCCTCG GCATTAGCAG CAACGGGTGT GACAGCTTTG GCATTCTACG GTATACACTG GCATTTTGCC AAAAAAGATC AATTAGAGCA AGTTGCAAAT CCTAATCGTC GTTACTTAAC AGCAGCTATT GCATCAGCTT TAATGGCACT CAATCCCGAA ATGATTGTTT GGGGGAGAGT TGGTGTTTCC GATATGTTAC TCACCGGTTG TATAGCCTCA GCTTTGCTTT GCTTCTTTTT GGGATACGCT CAAAATTCTT CCCCTTCTCC CTTCCCCAAT AAATGGTATC TGGCTTGTTA TGTATTGATG ACCGGAGCAA TTTTAACCAA AGGACCAGTG GGAATAGTTT TACCAGGATT AATTATGATT GCCTTTGCCC TATACTTAGG CAAATTCTGC GAACTGTGGC GAGAAATGCG CCCGATTTTG GGCATGGGAA TAGTCTTCGC TTTATCTGCT CCCTGGTACA TCTTGGTGAC TTGGCGCAAC GGCTGGAATT TTATTAATAC CTTTTTTGTT TATCACAACA TAGAACGCTT TACAGAGGTT GTGAATGGTC ACTCAGCCCC TTGGTATTTT TATTTTTTGG TAGTATTGTT GGGTTTTGCA CCATATTCAG TTTTTATACC TATGTCCATA GCCAGGTTAA AATTTTGGCA GCGCTCGCAC TGGAAAAATC AGGAACGTTC TCAACAATTG GGTTTATTTG CCTGTTTCTG GTTTTTGGGT GTATTTTGCT TTTTCACCAT CTCCGTCACC AAACTCCCCA GTTACGTATT ACCTTTAATG CCAGCAGCAG CCATTCTTGT AGCCTTATCT TGGAGTAACC TGTACCCAAA CACACAAACT CCTCAAGCTT TCCACATCAG TAGTTGGGTG AATGTGGCTT TTCTCTCAAC ACTTGGAGTG GCATTATTCA ACATATCCCA CATTATCGGC AAAGACCCCG CTGCACCTGA ATTGTACGAA CAAATACAAA ATTCAGGAAT GGCTAATGTG GGTGGTATAA TTTGGCTGAC TGGTGCTGTA ATTATCGCTA TTTTGATCCT CTCTTACCGT TGGCGTGCCA TCATTACTAT TAATTTGGTG GGTTTCGTAG CATTTTTATC ATTGGTTTTA ATGCCTGCTT TATTCTTGAT GGATCAAGAG CGTCAGGAAC CTTTAAGACA ATTATCTGCG CTCGCAGTCA AAGAAAAACA ACCCAATGAA GAATTAGTCA TGGTCGGTTT CAAAAAACCG ACCGTCACTT TCTACACTCA AAACAAAGTT AATTACCTGG AATTTTCCCA ACAAGCTTTA GACCATATTT ACAATCAAGC AGCCAACAAA ACACATCCAG CATCACTGCT ACTTCTGACC GAGCAGAAAA AGTTAATTGA TATGAACTTA CCACCAGATA TTTATAAAAA TATCGCCACC AAAGGAGCTT ATAATCTCAT TCGTATTCCC TTGCAGAGAA TTAAACAAAA CAAAAAGGAA AAAACAGACA TTTCGTAA
|
Protein sequence | MRLKLSNRSS VDQWINKIEQ RPALAVTISI VWLLLINYIA FVWNLGNIGL IDETEPLFAE ASRQMLVTGD WITPFFNGET RFDKPALIYW CQALAYSIMG VNEWAARIPS ALAATGVTAL AFYGIHWHFA KKDQLEQVAN PNRRYLTAAI ASALMALNPE MIVWGRVGVS DMLLTGCIAS ALLCFFLGYA QNSSPSPFPN KWYLACYVLM TGAILTKGPV GIVLPGLIMI AFALYLGKFC ELWREMRPIL GMGIVFALSA PWYILVTWRN GWNFINTFFV YHNIERFTEV VNGHSAPWYF YFLVVLLGFA PYSVFIPMSI ARLKFWQRSH WKNQERSQQL GLFACFWFLG VFCFFTISVT KLPSYVLPLM PAAAILVALS WSNLYPNTQT PQAFHISSWV NVAFLSTLGV ALFNISHIIG KDPAAPELYE QIQNSGMANV GGIIWLTGAV IIAILILSYR WRAIITINLV GFVAFLSLVL MPALFLMDQE RQEPLRQLSA LAVKEKQPNE ELVMVGFKKP TVTFYTQNKV NYLEFSQQAL DHIYNQAANK THPASLLLLT EQKKLIDMNL PPDIYKNIAT KGAYNLIRIP LQRIKQNKKE KTDIS
|
| |