Gene Ava_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1159 
Symbol 
ID3683354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1419380 
End bp1420930 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content42% 
IMG OID637716495 
ProductN-6 DNA methylase 
Protein accessionYP_321678 
Protein GI75907382 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000301602 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAAC GTAATGGTAA TGGGGACAAA TCCCTAGAAA ATTGGATATG GGATGCTGCT 
TGTAGTATTC GCGGAGCGCA GGAAGCAGCG AAGTATAAGG ATTTTATTCT GCCGTTGATT
TTTACTAAAC GACTCTGTGA TGTATTCGAT GATGAACTAA ATCGGATTGC TGAGAAGGTA
GGTTCTCGTG CCAAGGCGTT TAAGTTAGTG GCAATGGATC ATAATTTAGT GCGGTTTTAT
TTGCCACTGC AACCACAGAA TCCTGATGAT CCGGTTTGGT CAGTGATTCG CAAGCTTTCA
GACAAGATTG GGGAGAAGTT AACAGACTAT TTGCGAGAAA TTGCTAAGGC GAATCCTTTG
TTGAATGGGA TTATTAATCG AGTTGATTTT AATGCCACAA CTCATGGACA GCGTGACCTT
GATGATGATC GCCTCTCGAA CCTGATTGAA AAAATCTCGG AGAAGCGTCT AGGGTTAAAG
GATGTAGAGC CAGATATCAT TGGGCGCAGT TATGAGTATT TGATTCGCAA GTTTGCTGAA
GGTTCAGGAC AGTCAGCAGG AGAATTTTAC ACCCCGAAGG AAGTAGGGCT AATCATGGCG
AAGATTATGC AACCAGAACC AGGGATGACG ATTTATGATC CCTGTTGTGG TTCGGCAGGT
TTGTTGATTA AGTGTCAGTT GGTATTGCAA GAATCACAAG GTGCAACGGA AAAGTTTGCA
CCGTTGCAAC TGTATGGACA GGAATACACT CCGAATACTT GGGCAATGGC AAACATGAAC
ATGATTATCC ATGATATGGA GGGAAAAATC GAAATTGGGG ATACCTTTCG CCATCCGAAA
TTCATGCAAG CAGGGAAATT AGCTCAGTTT GAGCGAGTGG TGGCTAATCC CATGTGGAAT
CAGAAATGGT TCACAGAGAA AGATTATGAC GGTGATGAGT TAGGACGTTT CCCCAAAGGA
GCAGGTTATC CAGGTTCATC AGCTGATTGG GGTTGGGTAC AACATATTTT GGCATCCTTA
GATAAAACGG GAAAGGCAGC GATCGTTTTA GATACAGGTG CAGCGTCACG GGGTTCAGGG
AATGCTAATA AGAATAAGGA GAAGGAAGTT AGGAAGTGGT TTGTAGAACA GGATTTGATT
GAAGGGGTGA TTTATCTACC ACAAAATCTG TTCTATAACA CTTCTGCCCC AGGTATTCTT
TTATTTTTGA ATAGAGCTAA ACCGAAAGAA CGACAAGGTA AGCTATTTTT CATCAATGCA
AGTTTGGTAT TTGCTAAAGG CGATCCGAAA AATTATATTC CTGATGAGGA AATTGAGCGC
ATTGCCAACA CGTTTTTAAC TTGGCGGGAG GAGGAGAAAT TCAGCCTCAT TGTCTATAAG
GATAAGATTG CCCATAATGA TTATAATATT TCGCCATCTC GTTATATTCA TATAACAGAA
GAGGAGGATT TCAGACCCAT TGCGGAGATT TTGGAGGAGT TAGAGGTTTT AGAGAAGGAA
GCTGCGGAAA CGAATAAATT ATTAATGAAA GTTTTAGGGA GATATCAATG A
 
Protein sequence
MGERNGNGDK SLENWIWDAA CSIRGAQEAA KYKDFILPLI FTKRLCDVFD DELNRIAEKV 
GSRAKAFKLV AMDHNLVRFY LPLQPQNPDD PVWSVIRKLS DKIGEKLTDY LREIAKANPL
LNGIINRVDF NATTHGQRDL DDDRLSNLIE KISEKRLGLK DVEPDIIGRS YEYLIRKFAE
GSGQSAGEFY TPKEVGLIMA KIMQPEPGMT IYDPCCGSAG LLIKCQLVLQ ESQGATEKFA
PLQLYGQEYT PNTWAMANMN MIIHDMEGKI EIGDTFRHPK FMQAGKLAQF ERVVANPMWN
QKWFTEKDYD GDELGRFPKG AGYPGSSADW GWVQHILASL DKTGKAAIVL DTGAASRGSG
NANKNKEKEV RKWFVEQDLI EGVIYLPQNL FYNTSAPGIL LFLNRAKPKE RQGKLFFINA
SLVFAKGDPK NYIPDEEIER IANTFLTWRE EEKFSLIVYK DKIAHNDYNI SPSRYIHITE
EEDFRPIAEI LEELEVLEKE AAETNKLLMK VLGRYQ