Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3736 |
Symbol | |
ID | 9341541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3793251 |
End bp | 3794555 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | DNA-cytosine methyltransferase |
Protein accession | YP_003722402 |
Protein GI | 298492225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.730052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAGAC AACGGCCAAT TGCAGTTGAC TTATTTGCAG GTGCAGGAGG TATGACCCTT GGCTTTGAAC AAGCAGGTTT TGATGTACTT GTATCTGTAG AACTAGACCC AATTCATTGT GCAATTCATA AATTTAACTT TCCCTTTTGG AAAGTCTTAT GTAAAAGTGT AGAAGAAACA ACAGGGTCAG AAATTAGAAA TAGTTCTGAC ATTGGTAATC AAGAAATTGA TGTAGTGTTT GGCGGTCCAC CATGTCAAGG CTTTTCATTA ATTGGTAAAC GTTCTATCGA TGACCCTAGA AATACTCTAG GTTTCCATTT TATTAGGTTG GTTTTGGAAC TACAACCTAA TTTTTTTGTT TTGGAAAATG TTAAAGGAAT GACCGTAGGT AAACACAAAG AATTTACTAC GGAAATAATT GATAAGTTTG AAAATAATGG TTATAAAGTA AATCGAAATT ATCAATTATT AAATGCTGCT AATTATGGAG TACCACAAAA TCGAGAAAGA TTATTTTTAT TAGGTTGTCG TCAAGATTTA AAATTACCAA ATCATCCAGA TAAAATTACC CATCCTGCCA AATCTAATAA CTCTATAGCT TGCACCACAA TTGCACTATC AAAATTACCA TCAACACCTA CAGTTTTGCA AGCTATTCAA GACCTACCGG AAATAGAAAA TTATCCAGAA TTATATCAAC AGGATTGGGT AGTAACTGAT TTTGGAAAAC CTAGTAATTA CGGGAAAAAA ATGCGTCATC CTAGCCTATC CAAAAATAAT TATTCCTATC AGCGTAAGTT TAATCATAAC ATTCTAACAT CCAGTTTAAG AACAAAACAT AATCCCGAAT CCATCGAAAG ATTTGCATTA ACTCCCTATG GAAAAATCGA ACCAATCAGC CGTTTTTATA AACTAGCTCC TGATGGCTTA TGTAACACAC TCAGAGCAGG AACAGCAAGT AATAAAGGTG CATTTACTTC CCCTCGTCCC ATACATCCTT TTAAACCTAG ATGTATTACT GTTAGAGAAG CTGCACGGTT GCATTCTTAT CCTGATTGGT TTAGATTTCA CCCTACAAAA TGGCATGGTT TTAGACAAAT CGGTAACTCT GTTCCTCCAC TTTTAGCTCA AGCTGTAGCA TCAGAAATTA TTAAAGTGTT AGGTATAAAA TCTTCCCAAC TTAAGTTTGG TAAAGATTTA AAAGATTTAG GAGAAACCAG GTTATTAACA TTTGATATGT CAGAAGCTGC TAAATATTTT GATGTTAATC CTGATGTTAT AGAACCTAGA ATTAGAAAGA AATGA
|
Protein sequence | MFRQRPIAVD LFAGAGGMTL GFEQAGFDVL VSVELDPIHC AIHKFNFPFW KVLCKSVEET TGSEIRNSSD IGNQEIDVVF GGPPCQGFSL IGKRSIDDPR NTLGFHFIRL VLELQPNFFV LENVKGMTVG KHKEFTTEII DKFENNGYKV NRNYQLLNAA NYGVPQNRER LFLLGCRQDL KLPNHPDKIT HPAKSNNSIA CTTIALSKLP STPTVLQAIQ DLPEIENYPE LYQQDWVVTD FGKPSNYGKK MRHPSLSKNN YSYQRKFNHN ILTSSLRTKH NPESIERFAL TPYGKIEPIS RFYKLAPDGL CNTLRAGTAS NKGAFTSPRP IHPFKPRCIT VREAARLHSY PDWFRFHPTK WHGFRQIGNS VPPLLAQAVA SEIIKVLGIK SSQLKFGKDL KDLGETRLLT FDMSEAAKYF DVNPDVIEPR IRKK
|
| |