Gene Ava_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0101 
Symbol 
ID3683376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp134074 
End bp135369 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content43% 
IMG OID637715428 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_320622 
Protein GI75906326 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.999272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.127001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCA GCACCCAGGA ATTAAAACGC AACTCTAAGC AACGACGACC CATCGCCGTA 
GATTTGTTTG CAGGTGCAGG GGGGATGACT CTTGGTTTTG AACAGGCTGG TTTTGATGTG
TTGGCGGCGG TGGAAATTGA CCCCATTCAT TGTGCAGTAC ATGAGTATAA CTTTCCTTTT
TGCTCAGTTT TGTGCAAAAG TGTAGAGGAG ACAACAGGAA AAGAGATACG CGATCGCTCC
AAAATTAATA ACCAGGACAT TGATGTTATT ATTTGCGGTT CGCCCTGCCA AGGTTTTTCC
CTCATGGGTA AGCGAATTTT TGATGACCCC CGTAACTCCT TGGTATTTCA CTTTCATCGG
TTAGTACTGG AGTTACAACC GAAATTTTTT GTGATGGAAA ATGTGCGGGG GATAACCCTT
GGTGAACATA AACAAATCCT CCAAGCCTTG ATTCATGAGT TTAAAAGCCA CGGTTATCAA
GTGGAAGAGA ATTATCAAGT TCTCAATGCT GCTCATTATG GAGTACCGCA AGCGCGGGAA
AGATTATTTC TCATTGGTGC CAGGGAAGAT GTAAAGTTAC CAAAATACCC CAAACCAATT
ACCAAACCAG CGAAATCAAA TAACTCAAAA GCCAAGAATT TATCTCGTTT GCCACTTTGT
CCCACTGTTT GGGAGGCCAT TGGCGATTTA CCGGAGGTAG AACAATACCC GGAATTGTTA
ACAAGAGATT GGATAATTGC CGAGTATGGC AAACCCAGTA ATTACGCTGC CGTACTTCGC
GGTATTAGTA CTTTAGCAGA TGATTATTCA TGCGATCGCC TATTTGATTC TCGTCTTCTT
TCTTCCAGCC TCAGAACCAA ACATTCACAG ACAACTATAG AACGTTTTGC CGCTACAATC
CCAGGTGAAA GAGAACCAAT CAGCCGATTC CATAAACTGC ATCCATCTGG TGTCTGCAAT
ACATTAAGAG CAGGAACAGA TAAATATAAA GGTTCTTTCA CCTCTCCGAG ACCAATTCAT
CCATTCACAC CCCGATGTAT TACAGTCCGA GAAGCCGCAC GCTTGCATTC TTATCCAGAC
TGGTTTAGAT TTCATATCAC CAAATGGCAT GGTTTTCGCC AAGTCGGTAA CTCTGTACCG
CCATTACTAG CAAAAGCAGT TGCCAGCGAG ATTATTCGCA GATTGAATAT ATCACCTGTT
AAACCCAGTA TTCATTACCC ATTGGGGCAA GAAAAGCTAC TACAATTCAA TATCTCCCAA
GCTGCACAGC ATTATTCTAG CTTGAAAGGG GTGTAA
 
Protein sequence
MSSSTQELKR NSKQRRPIAV DLFAGAGGMT LGFEQAGFDV LAAVEIDPIH CAVHEYNFPF 
CSVLCKSVEE TTGKEIRDRS KINNQDIDVI ICGSPCQGFS LMGKRIFDDP RNSLVFHFHR
LVLELQPKFF VMENVRGITL GEHKQILQAL IHEFKSHGYQ VEENYQVLNA AHYGVPQARE
RLFLIGARED VKLPKYPKPI TKPAKSNNSK AKNLSRLPLC PTVWEAIGDL PEVEQYPELL
TRDWIIAEYG KPSNYAAVLR GISTLADDYS CDRLFDSRLL SSSLRTKHSQ TTIERFAATI
PGEREPISRF HKLHPSGVCN TLRAGTDKYK GSFTSPRPIH PFTPRCITVR EAARLHSYPD
WFRFHITKWH GFRQVGNSVP PLLAKAVASE IIRRLNISPV KPSIHYPLGQ EKLLQFNISQ
AAQHYSSLKG V