Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0101 |
Symbol | |
ID | 3683376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 134074 |
End bp | 135369 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637715428 |
Product | C-5 cytosine-specific DNA methylase |
Protein accession | YP_320622 |
Protein GI | 75906326 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | [TIGR00675] DNA-methyltransferase (dcm) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.999272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.127001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCA GCACCCAGGA ATTAAAACGC AACTCTAAGC AACGACGACC CATCGCCGTA GATTTGTTTG CAGGTGCAGG GGGGATGACT CTTGGTTTTG AACAGGCTGG TTTTGATGTG TTGGCGGCGG TGGAAATTGA CCCCATTCAT TGTGCAGTAC ATGAGTATAA CTTTCCTTTT TGCTCAGTTT TGTGCAAAAG TGTAGAGGAG ACAACAGGAA AAGAGATACG CGATCGCTCC AAAATTAATA ACCAGGACAT TGATGTTATT ATTTGCGGTT CGCCCTGCCA AGGTTTTTCC CTCATGGGTA AGCGAATTTT TGATGACCCC CGTAACTCCT TGGTATTTCA CTTTCATCGG TTAGTACTGG AGTTACAACC GAAATTTTTT GTGATGGAAA ATGTGCGGGG GATAACCCTT GGTGAACATA AACAAATCCT CCAAGCCTTG ATTCATGAGT TTAAAAGCCA CGGTTATCAA GTGGAAGAGA ATTATCAAGT TCTCAATGCT GCTCATTATG GAGTACCGCA AGCGCGGGAA AGATTATTTC TCATTGGTGC CAGGGAAGAT GTAAAGTTAC CAAAATACCC CAAACCAATT ACCAAACCAG CGAAATCAAA TAACTCAAAA GCCAAGAATT TATCTCGTTT GCCACTTTGT CCCACTGTTT GGGAGGCCAT TGGCGATTTA CCGGAGGTAG AACAATACCC GGAATTGTTA ACAAGAGATT GGATAATTGC CGAGTATGGC AAACCCAGTA ATTACGCTGC CGTACTTCGC GGTATTAGTA CTTTAGCAGA TGATTATTCA TGCGATCGCC TATTTGATTC TCGTCTTCTT TCTTCCAGCC TCAGAACCAA ACATTCACAG ACAACTATAG AACGTTTTGC CGCTACAATC CCAGGTGAAA GAGAACCAAT CAGCCGATTC CATAAACTGC ATCCATCTGG TGTCTGCAAT ACATTAAGAG CAGGAACAGA TAAATATAAA GGTTCTTTCA CCTCTCCGAG ACCAATTCAT CCATTCACAC CCCGATGTAT TACAGTCCGA GAAGCCGCAC GCTTGCATTC TTATCCAGAC TGGTTTAGAT TTCATATCAC CAAATGGCAT GGTTTTCGCC AAGTCGGTAA CTCTGTACCG CCATTACTAG CAAAAGCAGT TGCCAGCGAG ATTATTCGCA GATTGAATAT ATCACCTGTT AAACCCAGTA TTCATTACCC ATTGGGGCAA GAAAAGCTAC TACAATTCAA TATCTCCCAA GCTGCACAGC ATTATTCTAG CTTGAAAGGG GTGTAA
|
Protein sequence | MSSSTQELKR NSKQRRPIAV DLFAGAGGMT LGFEQAGFDV LAAVEIDPIH CAVHEYNFPF CSVLCKSVEE TTGKEIRDRS KINNQDIDVI ICGSPCQGFS LMGKRIFDDP RNSLVFHFHR LVLELQPKFF VMENVRGITL GEHKQILQAL IHEFKSHGYQ VEENYQVLNA AHYGVPQARE RLFLIGARED VKLPKYPKPI TKPAKSNNSK AKNLSRLPLC PTVWEAIGDL PEVEQYPELL TRDWIIAEYG KPSNYAAVLR GISTLADDYS CDRLFDSRLL SSSLRTKHSQ TTIERFAATI PGEREPISRF HKLHPSGVCN TLRAGTDKYK GSFTSPRPIH PFTPRCITVR EAARLHSYPD WFRFHITKWH GFRQVGNSVP PLLAKAVASE IIRRLNISPV KPSIHYPLGQ EKLLQFNISQ AAQHYSSLKG V
|
| |