Gene Anae109_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4223 
Symbol 
ID5377194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4949700 
End bp4952543 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content64% 
IMG OID640845751 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001381385 
Protein GI153007060 
COG category[L] Replication, recombination and repair 
COG ID[COG1041] Predicted DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCT GGGCTGGTAT GCACCTTCCC AGAATGCCCG CTCGCTCACG CCCAGCCAGG 
AACGCGGATT TCAACCTGTT CGTCGCCGAA CCGGTGTACT CGCAGCCGCG GCTGCCACAG
CTCTCGCGCC TCGAGGACCT GTCGAATCCC GCGGACGTGA TGCAGGCGCT CGGCGCGATC
GATTGGGCGT TCACGCACGA CGAGACCGGC TACCTCGGCC ACGATCTGCA TCCGTATCCC
GCGAAGTTCA TTCCACAGAT CCCGGCCAAC CTCATTGCTG CGCTATCGCT TCCCGGTGAG
CTCGTGTGGG ACCCGTTTGG CGGGAGCGGG ACGACAGCGC TAGAGGCGCT TCTGCTGGGA
CGTCAGGCAC TCTCTACCGA CGCAAACCCC TTGGCCGGCC ACATCGCACG CGCGAAGTGT
ACCGCGCTTG GACCTGAGCA GCGCGACGTG CTGAGGGCGC TCGGGCAGCG CGTTGCCGCC
CTCGCGCTCG ACCGCGGCCT CGAAGGCTTC CTTGAGCGTG CATGGGGGGC AGCGAAGGGC
TTTGTGCCGG ACATCCCGAA CTATGAACAG TGGTTCACCG CGCAGGCTAC ACGCGAACTC
GCGTACGTGC GGCAGCTCGC GAGCGCGATC GCGGATGACG ACGCTCGGCT GGTGTCTGAG
GTCGCTCTGT CCTCGATCGT TGTAGCGGTG TCGAATCAGG ATTCCGAAAC GCGCTACACG
CGGCGTGACA AAGGGCATCG CCCCGGAGAC GTCCTGCGGT TGTTCGCGAC CGCCCTTGAG
CGGATTCTCG TAGAGCACGA GCCCCTCGAG CGACTTCTCG GCTACCGCCG CGCTCGCGTC
GCGACTCTCG ACCTAAGGCA ACTTGACACG TCGCCGGACG CTCCTGAGCC GGAGTCGGTC
GACCTCATCG TCACGTCGCC GCCTTACGCG AACGCCACGG ACTATCACCT GTACCATCGG
TTCAGGCTTT TCTGGCTCGG GTTCGACCCG CGCGTACTGG GATCAGCAGA GATTGGCTCG
CATCTTCGCC ATCAGCGTGA GAAGCGCGGG TTCGATCTGT ACGCCGACGA AATGCTCGGA
TGTCTCGCCG GGATTGCGCG GCGTCTCCGG CCTGGCAGGT ACTGTGCCAT GGTAATCGGC
GGAGCGGTCT TCGATGGCAA GGAAGTTGAT TCGGCCGCCC GGCTCGGTGA GATCGGCACT
CAAGTCGGAC TGGAGTGGCT TGGAGCAGTC GAGCGGAAGA TTCACGCAAC TCGCCGCTCC
TTCGTTCCGG CCGCCCGCCG GTTGGGCGCC GAGCATATCG TCATTTTCAG GAAGCCGCCG
CGCAGCTTGA AGGTGACGTT CGAGTTGCCG AAGTACCGGT TGTGGCCGTA CGAGCATGAG
CTCAGGCTCC GGGAAGTTGA AAGGGTGGTC GGCGTCGCGC CCGTTTCGGC GGCCGAGGGC
AAACTGACGG CGACGTTGGA CTGCTACCGC GTTGACCGAG CTCGCCGCCT CGCCCTTACC
TCTGAGCTCA CGGTCGGGGC TGCGTGCACC GGTTGGCAGA CGTGGCAGGC TCGCCTCGAG
AACGGGGCAG CGGCGCGGCG AGACCCGAAG TACGTGACCC ATGGGATTCA CGACTACAAG
GGAAAGTTCT ACCCGCAGCT CGCGAAGACG CTCCTGAACC TCTCCGTCTC ACAGCCGGGT
TGCCGAATCC TCGATCCCTT CTGCGGCAGT GGGACGGTGC TCCTCGAGGC CCAGCTCAGC
GGCCACCGCG CAGTCGGATT CGACCTAAAT CCACTCGCAG TCCTCATCTC GCGCGCCAAA
ACTGCGGTCG CGACGGAGAA CGCACTGCTG CTCGACCGCG CACTGCGGTC GTTCTTGGAG
CAGCTCGGAG CGCCCGCGAC GGACACGGAC CTTGAGGTGT TTCCCGCCGC GACGCGCGAG
GAGGTCCTGA GTTGGTTCCC GCGTCCGGTC GCGCGTCGTC TAGGCAAAGT TAGGAGGCAG
GTCGAAGAGG TGCCGAACGA GACGGCGCAG CTTCTGCTGA AGGTGTTGCT CAGCAGTCTG
ATTCGAGAGG TGTCTCACCA GGAGCCGGCC GATCTGCGCG TGAGGCGCCG GAAGGAGCCG
CTCGCCGATG CGCCAGTCGA GATGCTCCTG CGCGCCAGGG TGGTCCGCTT TCGCGATCGG
TGTCGTCATT TCGGAGAACA GGCGGCAGCG GCCCCAAACC TGTTCCCGCC CGCACACGTC
GTGTCGCGGG ACTCAGGTAC CGAGGGCGCG GACTTGCGGG TCGGCGCCGA GGCGTATGAC
GCCGTGGTGA CGAGTCCCCC CTATGCGACG GCCCTTCCGT ATATCGACAC AGATCGGCTG
AGCTTGCTGT CACTGCTCGA TATTCCCAGC AACCTGCGCA GCTCGCTAGA GATGCAGCTA
ACGGGATCAC GCGAGATTCG GCAGCGAGAC CGAGCGGCAC TCGAGGCGGC CATCGACGGC
CTCGATGCGA CGATCGGGTC CCGGACGGCA TGTGCGATCG CAAAGAAGAT CCACCGGCAG
AATTCGGGCG CGGATGTCGG GTTCCGTCGA AAGAACATGG CGTCCTTGCT CATCCGGTAC
TTCACGTCGA TGTGGCGGAC TCTTTCGAAC CTGGACCGTG CCGTGCGCCC GGGGGGAGCC
ATAGCGATTG TGATCGGTGA CAACGTGACG CATACCGGCG CCGGTGAAGT CACAATCCAG
AGCTCGAAGG CTATTCAAGA GATGGGAGAT CGGCTCGGGT GGACTCTCGA TGCGTGCGTG
CCGATCACGG TGACGAAGGA GGCACGGTTG AACGCGCACC ACGCGATCAC AAGGAACGAC
ATTCTGTTGT TCCGGCGGAG GTGA
 
Protein sequence
MSVWAGMHLP RMPARSRPAR NADFNLFVAE PVYSQPRLPQ LSRLEDLSNP ADVMQALGAI 
DWAFTHDETG YLGHDLHPYP AKFIPQIPAN LIAALSLPGE LVWDPFGGSG TTALEALLLG
RQALSTDANP LAGHIARAKC TALGPEQRDV LRALGQRVAA LALDRGLEGF LERAWGAAKG
FVPDIPNYEQ WFTAQATREL AYVRQLASAI ADDDARLVSE VALSSIVVAV SNQDSETRYT
RRDKGHRPGD VLRLFATALE RILVEHEPLE RLLGYRRARV ATLDLRQLDT SPDAPEPESV
DLIVTSPPYA NATDYHLYHR FRLFWLGFDP RVLGSAEIGS HLRHQREKRG FDLYADEMLG
CLAGIARRLR PGRYCAMVIG GAVFDGKEVD SAARLGEIGT QVGLEWLGAV ERKIHATRRS
FVPAARRLGA EHIVIFRKPP RSLKVTFELP KYRLWPYEHE LRLREVERVV GVAPVSAAEG
KLTATLDCYR VDRARRLALT SELTVGAACT GWQTWQARLE NGAAARRDPK YVTHGIHDYK
GKFYPQLAKT LLNLSVSQPG CRILDPFCGS GTVLLEAQLS GHRAVGFDLN PLAVLISRAK
TAVATENALL LDRALRSFLE QLGAPATDTD LEVFPAATRE EVLSWFPRPV ARRLGKVRRQ
VEEVPNETAQ LLLKVLLSSL IREVSHQEPA DLRVRRRKEP LADAPVEMLL RARVVRFRDR
CRHFGEQAAA APNLFPPAHV VSRDSGTEGA DLRVGAEAYD AVVTSPPYAT ALPYIDTDRL
SLLSLLDIPS NLRSSLEMQL TGSREIRQRD RAALEAAIDG LDATIGSRTA CAIAKKIHRQ
NSGADVGFRR KNMASLLIRY FTSMWRTLSN LDRAVRPGGA IAIVIGDNVT HTGAGEVTIQ
SSKAIQEMGD RLGWTLDACV PITVTKEARL NAHHAITRND ILLFRRR