Gene Ava_C0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0232 
Symbol 
ID3678031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp272499 
End bp273746 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content43% 
IMG OID637715312 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_320506 
Protein GI75812889 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.547554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000406028 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACTCAA CAGCAACAAA CAAAAAAATA GCTATTAGCC TATTCTCCGG GGTGGGTGGA 
TTTGATTTGG GATTTGAAGC AGCAGGATTT GAAATCGCGA TCGCCATTGA TAATAATCCC
ATCGTCCTAG CAACATACCA ACATAATTTC CCGCACGCCA CAGTCTTGTG CAAAGACATT
CGGGAGGTGA CGGCGCAAGA AATACGTGCC TGTATTCAGG CGAAGTATGT AGATTGGGAC
GGGGAAATTC ATACCGTTTT TGGTGGCCCA CCATGCCAAG GATTTAGCGT TGCTGGACTG
CAAAATGTTG AGGATGAGAG AAACTCGTTG GTGGGGGAGT TTGTTCGCCT GGTGTTGGAA
CTCAATCCTC TTGCGGCAAT CATGGAGAAT GTGCCAGGGA TTGAGAATCA GAAGTTTGGC
TGCATTACTG CTAACCTCCA AGCAGTGCTA GAAGAACATT ATTTTCTCTC AAAGTGGAAC
CTCACCGCTT CAGATTACGG AGTTCCGCAA GCTAGAAAAA GGGTGTTTTT TGTTGCATCT
AAGTTTGGAG AAATTATACC GCCGGAGCAT CAACCTCAAC ATACAGTTAG AGATGCGATC
GCGGACTTGT TGCCAGTCCC CCTACTCCCC AAGCAAAACA CTCAAGAATG GCATCCAGAT
TGGGTGAAGG GAGAATATGC TAAGTATCTT GAAAAAATAT TCCCAAATTT TGGTATAGTA
ACTAACATTG AGACGGGATT CGCAGCGACA ACACATACAC CAGAAGTAAT TCAGCAATTC
ATCAACACTC CCCCAGGTGC AAGGGAAGCT AAATCCAAAT CAAAGAAGCT GCAATGGGAT
GGATTCTGCG TGACGTTAAG AGCGGGGAGT GGCAACCGCA CTGCATTGCG TCCCCTGCAT
CCAGAACAGC CACGAGTTAT CTCAGTTAGA GAAGCTGCTC GTTTGCACAG CTACCCCGAT
TGGTTTAATT TTAGTGAGGC AATACTCCAC GCCCAAAGAG AAATCGGAAA TTCGGTACCC
CCATTGCTTG CATATGCTGT GGGAATGCAA GTTAAAGAAC ATCTAGAATG CAATATCAAT
TATCAGATTA AGTGCCAAAA TAGGCATTTT TGCCAATTTT TAACAATTTA TTGCAATTTT
AAGAATATAT TTATTTTTGT GAATGAAATG CTGTCTTTAA GCGGGTTGTC CATTAAAGAA
TTATTTGTCA ATTTTAAAAA ATTAATGTCA GGCAGTGTCA GAGGCTAA
 
Protein sequence
MNSTATNKKI AISLFSGVGG FDLGFEAAGF EIAIAIDNNP IVLATYQHNF PHATVLCKDI 
REVTAQEIRA CIQAKYVDWD GEIHTVFGGP PCQGFSVAGL QNVEDERNSL VGEFVRLVLE
LNPLAAIMEN VPGIENQKFG CITANLQAVL EEHYFLSKWN LTASDYGVPQ ARKRVFFVAS
KFGEIIPPEH QPQHTVRDAI ADLLPVPLLP KQNTQEWHPD WVKGEYAKYL EKIFPNFGIV
TNIETGFAAT THTPEVIQQF INTPPGAREA KSKSKKLQWD GFCVTLRAGS GNRTALRPLH
PEQPRVISVR EAARLHSYPD WFNFSEAILH AQREIGNSVP PLLAYAVGMQ VKEHLECNIN
YQIKCQNRHF CQFLTIYCNF KNIFIFVNEM LSLSGLSIKE LFVNFKKLMS GSVRG