Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_C0232 |
Symbol | |
ID | 3678031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007412 |
Strand | + |
Start bp | 272499 |
End bp | 273746 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637715312 |
Product | C-5 cytosine-specific DNA methylase |
Protein accession | YP_320506 |
Protein GI | 75812889 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.547554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000406028 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACTCAA CAGCAACAAA CAAAAAAATA GCTATTAGCC TATTCTCCGG GGTGGGTGGA TTTGATTTGG GATTTGAAGC AGCAGGATTT GAAATCGCGA TCGCCATTGA TAATAATCCC ATCGTCCTAG CAACATACCA ACATAATTTC CCGCACGCCA CAGTCTTGTG CAAAGACATT CGGGAGGTGA CGGCGCAAGA AATACGTGCC TGTATTCAGG CGAAGTATGT AGATTGGGAC GGGGAAATTC ATACCGTTTT TGGTGGCCCA CCATGCCAAG GATTTAGCGT TGCTGGACTG CAAAATGTTG AGGATGAGAG AAACTCGTTG GTGGGGGAGT TTGTTCGCCT GGTGTTGGAA CTCAATCCTC TTGCGGCAAT CATGGAGAAT GTGCCAGGGA TTGAGAATCA GAAGTTTGGC TGCATTACTG CTAACCTCCA AGCAGTGCTA GAAGAACATT ATTTTCTCTC AAAGTGGAAC CTCACCGCTT CAGATTACGG AGTTCCGCAA GCTAGAAAAA GGGTGTTTTT TGTTGCATCT AAGTTTGGAG AAATTATACC GCCGGAGCAT CAACCTCAAC ATACAGTTAG AGATGCGATC GCGGACTTGT TGCCAGTCCC CCTACTCCCC AAGCAAAACA CTCAAGAATG GCATCCAGAT TGGGTGAAGG GAGAATATGC TAAGTATCTT GAAAAAATAT TCCCAAATTT TGGTATAGTA ACTAACATTG AGACGGGATT CGCAGCGACA ACACATACAC CAGAAGTAAT TCAGCAATTC ATCAACACTC CCCCAGGTGC AAGGGAAGCT AAATCCAAAT CAAAGAAGCT GCAATGGGAT GGATTCTGCG TGACGTTAAG AGCGGGGAGT GGCAACCGCA CTGCATTGCG TCCCCTGCAT CCAGAACAGC CACGAGTTAT CTCAGTTAGA GAAGCTGCTC GTTTGCACAG CTACCCCGAT TGGTTTAATT TTAGTGAGGC AATACTCCAC GCCCAAAGAG AAATCGGAAA TTCGGTACCC CCATTGCTTG CATATGCTGT GGGAATGCAA GTTAAAGAAC ATCTAGAATG CAATATCAAT TATCAGATTA AGTGCCAAAA TAGGCATTTT TGCCAATTTT TAACAATTTA TTGCAATTTT AAGAATATAT TTATTTTTGT GAATGAAATG CTGTCTTTAA GCGGGTTGTC CATTAAAGAA TTATTTGTCA ATTTTAAAAA ATTAATGTCA GGCAGTGTCA GAGGCTAA
|
Protein sequence | MNSTATNKKI AISLFSGVGG FDLGFEAAGF EIAIAIDNNP IVLATYQHNF PHATVLCKDI REVTAQEIRA CIQAKYVDWD GEIHTVFGGP PCQGFSVAGL QNVEDERNSL VGEFVRLVLE LNPLAAIMEN VPGIENQKFG CITANLQAVL EEHYFLSKWN LTASDYGVPQ ARKRVFFVAS KFGEIIPPEH QPQHTVRDAI ADLLPVPLLP KQNTQEWHPD WVKGEYAKYL EKIFPNFGIV TNIETGFAAT THTPEVIQQF INTPPGAREA KSKSKKLQWD GFCVTLRAGS GNRTALRPLH PEQPRVISVR EAARLHSYPD WFNFSEAILH AQREIGNSVP PLLAYAVGMQ VKEHLECNIN YQIKCQNRHF CQFLTIYCNF KNIFIFVNEM LSLSGLSIKE LFVNFKKLMS GSVRG
|
| |