Gene Anae109_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3987 
Symbol 
ID5378133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4651893 
End bp4654796 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content68% 
IMG OID640845514 
Producthypothetical protein 
Protein accessionYP_001381149 
Protein GI153006824 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.375617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAT CCGCGGGGAC ATTTCCGCGG CGCCTCATCG AGGTCGATCT TCCGATCGGC 
CAGATCTCGG AGTCCGCGCG CGATGGCCGC TCGGCACACC ACGGCCACAT CACGGCGATC
CACATCTGGT GGGCGCGCAA GCCTCTGCCC TCGTGTCGTG CCGCGGCCCT TGCTGCCGCC
TTGCTCGACC CCGCGGACTC CGCTTGCCCG CAGGAGTTCA GGGACAAGGC GCGTGTGGTA
CTCAGCCGTA TCTACCGGGG ACCAGACAGC GCGCAGCTCG AAGCAGAGGA ACTGCGACGC
GGGCTGCTCC GGCTCGTCTC GGACTTCGCG TCTTGGGACA GGGCGACCGA TCCAACGTAT
CGAGACGCGG CTCGCGAGCT TGTCGCGTCC GCGCACCAGG CGCTCTTCGG GAAAAACGGA
GGCGCGTTCC TCGCGGACCC GTTCTGTGGT GGCGGATCCA TTCCGCTCGA GGGGCTGCGG
CTCGGAATGT CGGCGTATGC GTCCGACTTG AATCCGGTCG CGACTCTCAT CAGCAAGGTC
ACGCTCGAGT ACGTCCAGCG GTTCGGCGAA AAGCTGTTCG AGGAGGTCGA GCGCTGGGGC
GCGCGAGTCG GCGAGGAGGC GCGGGGAGAG CTGAACGCGT TCTATCCCGT TGTGCAGGGG
CAGCAGCCGA TCGCGTCCAT CTCGTTCAGG CGCATCCGGT GCGAGGGCCC GAAGTGCGGC
GCGGACGTCC CTCTCACGAG CAAGTTCCAC CTGACGCGGC GCGGTGACCG GTCTGTCGGG
CTCCGGCTGG ACGGATGGGA GGGGCCCACG CCCCGCTTCA GCATCGCGGA AGGCCCCCTC
GGCTCGTTTC CGGACCCCAC CGTTCGGCGG GGGGCAGCAA CTTGCCTGAA GTGCGGGTAC
ACGACGCCGG TCGAGCGCAT TCGCGCTCAG CTCTCGGAGC GGGGCGGCGG TGCAGACGAC
GCGCTTCTCG TCGCGGTCGC GGTAGGAGAA GAGTCGGGCG AGCGAACGTT CAGGCGTCCG
GCGAAGGCCG ACCTGAATGC GATTGCCGCT GCGAAGAAGA AGGTGGCGCT CCTCCGGCGG
AGAGGCGACT TGGGCCTGCC AGAGCTCCCC GACGAGCCGC TGCCTCCGGT GGGAACACTC
GGATTCCGCG TGCAGCGGTA CGGCATGCTC CGGTGGCGCG ACATCTATAC GCCGCGGCAG
CTCGTCACGA TCACGACGCT GGTGCGCCTG GTTCAAGGCG TGATGGCCGA GGATCGGGCG
GCGCACGGGC TCGGTGTGGC GGTACGAGCC TGTCTCGCGC TCGCCGTCGA CCGTCTATGC
GATTACCAGA ACACGGGTTG CTCCTGGAAT CCAAGCGGCT CGGCGTTGCC GCACCTCTTC
ACCCGCCAGG CCCTCCCGAT CATCTGGGAC TTCGGCGAGG CGAATCCGCT CGCATCGTCC
TCCGGGTCGT GGGCGGGCGC CGTCGAGCAC GTGCTCCGTG GGTTGAGGAA CGCGCACGTG
AGCACCGGGG CGGCGGACGT GGGCATGGCG TCTGCGAGCC ATCACCCGTT GCCGTCCGAC
TGCGCGCATC TCCTCGTGAC GGATCCGCCT TACTACGACG CGATACCGTA CGCGGACCTC
TCCGACTTCT TCTACGTCTG GCTCCGGCGC GTGCTCGGTC CCGATCATCC CGAGCTCTTC
AAGACGCCTC TGGTGCCTCG GGACGACGAG TGCATCGTCA ACCCGGCCAC GGGGAAGGAC
CGGGCCTACT ACCGCCGCGT CATGACCGCC GCCCTCACCG AGGCGAGGCG TGTCACCCGG
CCGGATGGCA TCGGCGTCGT CATCTTCGCG CACAAGTCGA CGAGCGGCTG GGAGGACCTG
CTAGCTGCGA TGCTCGACGC GGGATGGGTC GTCACGGCGT CGTGGCCGAT CGACACGGAG
AACGCCGGCC GGCTGCGCGC GCGCAACTCG GCCGTGCTCG CCTCGTCGGT TCATCTCGTG
TGCCGGCCGC GCGAGTATCC AGATGGGCGG CTCATCACGG ATACGGTGGG TGATTGGCGG
GACGTGCTCG CGGCGCTTCC CAAGCGAATC GCGGAGTGGA TGCCGCGGCT CGCTCGCGAG
GGCGTCGTGG GCGCCGACGC GATCTTCGCG TGCCTCGGCC CGGCCCTCGA ACTCTTCTCT
CGGTATGCGC GCGTCGAGAA AGTTTCGGGC GAGGTCGTGC CGCTCGAGGA GTACCTCGAG
CACGTATGGG CGGCTGTCTC GCGCGAGGCG CTGGCGATGA TCTTCGACGA AGCCGAGTCG
GCTCGCCTCG AGGAGGACGC GAGGATCACG GCCATGTGGC TGTGGACGCT TGCATCAGAC
CGAGCTCGAG CCTCGGCCGC GGACGACGGC GGCGGGGATG CGCCGGCGAG CGACGACGCC
GAATTGGACG CAGACGGAGC GGATGGTTTC AGCCTCGAGT ACGACGCCGC GCGAAAGATC
GCCCAGGGGC TGGGCGCCCG TCTCGAGCAG CTCCCCCATG TCGTGGTGGT CAAAGCGAAC
CAGGCGCGGT TGCTGTCGGT CGCAGAGCGA ACCAAGCACC TGTTCGCAGG CGACGAAGGA
GTCGTGCCGG CGAAGCGGGC CTCGAAGAAG CAGATGGGGT TGTTCGCCGA GCTGCAGGAA
GCCGCGAAAG CGCAGGGCTG GGGCGAGCGC GGCGCGCCGA AGGCCGCCGC GACCACCTTG
GACCGAGTAC ACCAGGCCAT GCTCCTGTTC GCATCGGGTC GCGGCGAAGG GCTGAAACGC
TTCCTCGTCG CGGAGGGCGT CGGAAACCAG GGCCAGTTCT GGACCCTCGC GCAGTCCCTA
TCCGCCCTCT ACCCAACCGG ATCGGAGGAG AAGCGGTGGG TGGACGGAGT GCTCGGGCGC
AAGAAGGGGC TCGGCTTCGG ATGA
 
Protein sequence
MSASAGTFPR RLIEVDLPIG QISESARDGR SAHHGHITAI HIWWARKPLP SCRAAALAAA 
LLDPADSACP QEFRDKARVV LSRIYRGPDS AQLEAEELRR GLLRLVSDFA SWDRATDPTY
RDAARELVAS AHQALFGKNG GAFLADPFCG GGSIPLEGLR LGMSAYASDL NPVATLISKV
TLEYVQRFGE KLFEEVERWG ARVGEEARGE LNAFYPVVQG QQPIASISFR RIRCEGPKCG
ADVPLTSKFH LTRRGDRSVG LRLDGWEGPT PRFSIAEGPL GSFPDPTVRR GAATCLKCGY
TTPVERIRAQ LSERGGGADD ALLVAVAVGE ESGERTFRRP AKADLNAIAA AKKKVALLRR
RGDLGLPELP DEPLPPVGTL GFRVQRYGML RWRDIYTPRQ LVTITTLVRL VQGVMAEDRA
AHGLGVAVRA CLALAVDRLC DYQNTGCSWN PSGSALPHLF TRQALPIIWD FGEANPLASS
SGSWAGAVEH VLRGLRNAHV STGAADVGMA SASHHPLPSD CAHLLVTDPP YYDAIPYADL
SDFFYVWLRR VLGPDHPELF KTPLVPRDDE CIVNPATGKD RAYYRRVMTA ALTEARRVTR
PDGIGVVIFA HKSTSGWEDL LAAMLDAGWV VTASWPIDTE NAGRLRARNS AVLASSVHLV
CRPREYPDGR LITDTVGDWR DVLAALPKRI AEWMPRLARE GVVGADAIFA CLGPALELFS
RYARVEKVSG EVVPLEEYLE HVWAAVSREA LAMIFDEAES ARLEEDARIT AMWLWTLASD
RARASAADDG GGDAPASDDA ELDADGADGF SLEYDAARKI AQGLGARLEQ LPHVVVVKAN
QARLLSVAER TKHLFAGDEG VVPAKRASKK QMGLFAELQE AAKAQGWGER GAPKAAATTL
DRVHQAMLLF ASGRGEGLKR FLVAEGVGNQ GQFWTLAQSL SALYPTGSEE KRWVDGVLGR
KKGLGFG