Gene Anae109_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3988 
Symbol 
ID5378134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4654793 
End bp4657741 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content62% 
IMG OID640845515 
Producthypothetical protein 
Protein accessionYP_001381150 
Protein GI153006825 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.172712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTAACG GGCACCAGCC GAGGCGGCTG ATCGAGGTTG ATCTCCCTAT TCGTGTCATC 
TCGCAGCACG CCCGTCACGA AAAGTCGATC CGCCACGGTC ACCTATCGAC GCTACACATC
TGGTGGGCAA GGCGTCCTCT CGCCGCCTGC AGAGCAGTCG CGTTGGCAGC GCTCCTTCCC
GATCCTGCGG ACGAGTATTG CCCCAAGGAA TTCAGGGAGG AGGCGGCCGC AGCGCTGTCG
TGCCTTCGCG ACGCCGTCGG AGGTCCGAGG GTTGATTGGA ATTCGGAAAC CGAGCTCCGA
AAAGCATTGC TACGCTTCGT TGGAGATATC GCCGCGCACG AAAACGCATC GTCGGGTCCC
GTGCTTGATG CGGCGCGACG CATGGTTCTG TCGGCACAGT TGAGTCTGCA CCCCGGTCGT
ACTGACAGAC CGCTTATCGT TGATCCGTTC GCCGGTGGCG GGGCGATCCC AGTTGAGGCG
CTACGGCTTG GGGCCGACGT GTTTGCGTCG GACCTCAATC CGATCGCCGT ACTTCTTAAC
CGGCTGAGTG CGGAGCTTCT GCCAAAGTTC GGGGCACAGT TGGCTGACGA ACTCGAGCGC
TGTGGCGAAT GGGTGGCGTC ACGAGCTGAG CAAGAGTTGC GTCGCTTCTA TCCGGCCGGT
TCTGATGGGA GTGCTCCCAT TGCCTACCTA TGGGCGCGCA CGATCAGGTG TGAGGGGCCG
GGGTGTGGTG TTGAGCTGCC GCTGATCCGC TCCACGGTTA TTGCGCGGAA GTCGGGTCGG
TCCATGTTCC TCAAGTTGCG CGTCGTTAAG AGCGCTAATC GAATAGATTT CGCAATCGAG
GAGGGCACAC CATCTGCGGC AGAGGCGCTG GGAACCATTA AGCGCGGGTC AGCCACCTGC
CCGTTATGTG GGTTCACTAC GGCGAACGCG AGGTTGCGAG CGCAGCTATC GGAACGCCGC
GGCGGTGCAG CAGACGCTCG GCTTCTGGCC GTAGTCTCGA CTAAACGGGG AGAGCAAGGA
AGGAAGTACA GGCTCCCGAC AACCAAAGAT GTCGAGGCAT TCGCCCAAGC GCAGAACGAG
CTACGAAAGC GCCAGTCCTC GTTCGAGGGG GCCATTCCGC TAGTACCTGA TGAGCTAGTC
CCGGCGGAGC GGCCGTCCCC TAATGCGAGA GGGCTGTCCG CGGTTACACG TATGGGAGTT
CGGACCTTCG GCGACCTGTT CACGCCACGG CAACTCCTCG CTCATACGAC GTTCGTGCGT
TTGTGCCGCG AAGCGGGTGC GGACATCGGC TCGCCTGAAA TGAGGAAGGC GGTCCGCCTT
TGTCTGGCGT TGTCTCTCTC GAAGGCGACA GATCTAGGGA ATTCGTGCAC TCGATGGAAG
CCGGATGCGG AATGCCCCGT TAACCTGTTT GCGCGGCAAG CCATCCCGAT CGTGTGGGAT
TTTGCGGAAA CGGTTTCGCT GTCCGATGCG AGTGGGTCCT GGCGAAGCAT GTTCGAGCGA
ACCGCATACG CACTTCGGCA GTGCTCGTTC GAGGCGCCTG GAAAAGCGAC AGTGCAATCG
GCTTCGGCGG CTGAACACCC GCTGCCGGAT GACGCCGCGG CTGCGCTTGT TACGGACCCT
CCTTACTACG ACGCTGTCCC GTACGCCGAT CTTTCTGACT TCTTCTATGT CTGGCTGCGG
CGTGTCCTCT TCGACGACGC CCCTGACCTC TTCAGTTCCC GGACAACGCC GAAGGACGAG
GAAGCGATTT GGAACCCGAC TCGCAAGTAC GGGCCGACCG GACGTCAGAA GGATCAGGCC
TTTTATGAGG AGCAGATGTA TCGGTGCCTC GCGGAGGCAA GGCGCGTTAC GGCCCCCGAT
GGCATCGGGG TCGTGGTGTT CGCGCACAAG AGCACGGAGG GATGGGAGGC AATTCTCGGC
TCGCTGATCC GCGCAGGTTG GGTCGCGACG GCCTCTTGGC CGATCGATAC GGAGATGGGA
AGCCGGGTCA ATGCGATGGG GACCGCATCG CTGGCGTCTT CTGTTCACAT CGTGTGCAGG
CCCAGGGGAG TTGATCAAGC ACACGTCGGT GAATGGAAGG TCGTCCTAGC TGAGCTCCCT
GAGCGTATCC ACCAATGGCT TCCGCGCCTA GCTCACGAGG GAGTTGTGGG TGCAGATGCG
ATTTTCGCGT GTCTCGGGCC AGCATTGGAG ATCTTCTCTC GCTACTCGCG TGTCGAGAAG
GTGAACGGCG AGGCGGTGCC GCTTCGCGAG TACCTGGAGC ACGTCTGGGC AGCAGTAGCT
CGCGAGGCGC TCGCGTCGAT CTTTCGAGAC GCGGACACTG CCGGCCTCGA GGCAGATGCG
CGCCTGACTG CGATGTGGCT GTGGACTCTT GCTGGACCCG AGCCGAGCGG TGACTCGGGC
GATGAGCAGG ATCAGGTGCC CGACGAGGAT GAGGATGACG ATCAGGGGGA CAGGGGCGGA
TCTGGGGGTG CGGTTCTTCC GTTCGACACC GCGAGGAAGA TCGCGCAGGG ACTCGGTGTC
CGATTCGATG AACTTCAGCA AGTAGTCGAA ATCAAGAAGG ACAAGGCTCG CCTTATTGCA
GTAGCCGAAC GCGCGAAATA CTTGTTCGGC AGGCACGAAG GCGTGCCTGC CGGCAAGAAG
GCGGCCGCCA AGAAACAGGC GGTGCTATTC ACGGACCTCG AGCGACCTGC CGGCGAGGAG
GCGTGGGGAG AGGGCGGCGC GCCAAAGGCT GGAACGACGA CTCTCGACCG CGTGCATCAA
GCGATGCTTC TTTTTGGGGG CGGGCGCAGT GATGCGCTGA AGCGCTTCCT TGTGGAAGAT
CGCATCGGCA TGCAGGCGCA ATTCTGGAAG CTAGCGCAGT CGCTGTCGGC TCTTTATCCG
AGCGGTTCCG ACGAAAAGCG GTGGGTCGAC GGCGTCCTTG CTCGAAAGAA GGGACTTGGC
TTCGGATGA
 
Protein sequence
MVNGHQPRRL IEVDLPIRVI SQHARHEKSI RHGHLSTLHI WWARRPLAAC RAVALAALLP 
DPADEYCPKE FREEAAAALS CLRDAVGGPR VDWNSETELR KALLRFVGDI AAHENASSGP
VLDAARRMVL SAQLSLHPGR TDRPLIVDPF AGGGAIPVEA LRLGADVFAS DLNPIAVLLN
RLSAELLPKF GAQLADELER CGEWVASRAE QELRRFYPAG SDGSAPIAYL WARTIRCEGP
GCGVELPLIR STVIARKSGR SMFLKLRVVK SANRIDFAIE EGTPSAAEAL GTIKRGSATC
PLCGFTTANA RLRAQLSERR GGAADARLLA VVSTKRGEQG RKYRLPTTKD VEAFAQAQNE
LRKRQSSFEG AIPLVPDELV PAERPSPNAR GLSAVTRMGV RTFGDLFTPR QLLAHTTFVR
LCREAGADIG SPEMRKAVRL CLALSLSKAT DLGNSCTRWK PDAECPVNLF ARQAIPIVWD
FAETVSLSDA SGSWRSMFER TAYALRQCSF EAPGKATVQS ASAAEHPLPD DAAAALVTDP
PYYDAVPYAD LSDFFYVWLR RVLFDDAPDL FSSRTTPKDE EAIWNPTRKY GPTGRQKDQA
FYEEQMYRCL AEARRVTAPD GIGVVVFAHK STEGWEAILG SLIRAGWVAT ASWPIDTEMG
SRVNAMGTAS LASSVHIVCR PRGVDQAHVG EWKVVLAELP ERIHQWLPRL AHEGVVGADA
IFACLGPALE IFSRYSRVEK VNGEAVPLRE YLEHVWAAVA REALASIFRD ADTAGLEADA
RLTAMWLWTL AGPEPSGDSG DEQDQVPDED EDDDQGDRGG SGGAVLPFDT ARKIAQGLGV
RFDELQQVVE IKKDKARLIA VAERAKYLFG RHEGVPAGKK AAAKKQAVLF TDLERPAGEE
AWGEGGAPKA GTTTLDRVHQ AMLLFGGGRS DALKRFLVED RIGMQAQFWK LAQSLSALYP
SGSDEKRWVD GVLARKKGLG FG