Gene Anae109_4126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4126 
Symbol 
ID5375436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4827453 
End bp4829549 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content76% 
IMG OID640845653 
Productpeptidase S41 
Protein accessionYP_001381288 
Protein GI153006963 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.257296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0023076 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGGCCG GCGCCGCACA TCGGGCGACA CGCGCCCAGC TCGACGCACA GTCGGCCGCG 
CCGTCCACGC GCGAGCGGGC ATGCCCGGCT TCGGAGGGGT GTTGGTGGAG TCGCGCGAGG
GCCGCCGAGG CTGGTGGGGC CCGCTCGCAC CTCCGACCAC CGATGGCGCT GCCCGCACTG
CTCTCGCTCC TCTTCGCCGT CGCCGCGACG CCCGCGGCGG CCCACGCGTC GGCCGTCCCG
CCTCCGCCCG CCCTCGGCGG CGCCGCACCC GTCGAGCCCG CGGCGTCGCC GGCGGCCCCC
GCCGCCGACG TGTGCGACGG GACCGCCTCC CTGCCGGAGG ACCTGGCCTG CGGCGACGCC
CCGCTGTGCT CCGCGCGCCA GCGGGTGAAG CTCGCGTGCG ACCTGCGCGA CGCGATGGAG
AAGCGATACG TCTTCTTCCC GGTGAAGGGG CGCCTCCTCG CGCGCGACGG CAAGCCGAGC
TTCGACTCGC GCGCGCACCT CGACGCGTGC GTGGCCGAGG AGCGCGCCAT CCCGCGGGAG
GACGAGCCGT TCCGCTTCTA CGACCGGATG CGCCGCTGCA CCGCCGCCTT CGAGGACGGG
CACCTGCTCT TCGGCGCGCC GGTGCGGCTC CCGCAGGTCG CGCTCGGGTT CGGCCTGCGG
CTCACCGCGG GGGGCGAGGT CCTCATCGCG AACCGCGAGA AGAGGCTCCT CGGCCACCTC
GCGAAGATGC CGGGGCTCCG CGGGATCGAC GAGGTCCTCG CGGTGGGCAA CGAGGTGCTG
GAGATCGACG GGGCGCCCGC GGCGGAGCGG GTCGAGGAGC TCGCCCGCTA CGTGCAGGGG
AGCTCGCGTG CCGCCCGCCT GGAGCGGGCG GTCGACGCGA TGACGCGCCG CGACTTCGCC
TACCCGCGGC GGCGCACCAC GACGATCGCG GTGTCGGTGC GCGGCGTCCG CCGGGTGGTG
GAGCTGCCCT GGTGGATCTC TCCCGACGCC GCCCGGCACC CGATGACGGA GCGCTACGTC
CGCCGCACCG GGATCGCCAC GAACGAGCGG CTCGCCTGGC GGTACGACAA GGGCGCCGAC
ACCTGGGCGC GCGATCCGGG CGCCGCCGAG GGGTACCTGC GGACGGACAC CATCCTGCCG
GCGCGAGACG CCGCGGCGCT CGCCGAGTAC CTGGACGAGG GCGACCGCGT CGCGATGCGG
CTCGGCGAGG TGGTGCGCCG CCGCGATCGC GCCTTCTGCT ACGCGCAGGT GCTGTCGTTC
CACACCGAGC AGCTGCACTC GGACGGCGGG GAGCCGCGCG CGTTCACCTC GGTGCTCGAG
CGCTTCGTCA GCGAGTGCAA GGGGAAGGGC CTCGACCTCG TGGTGGACCT GCGCCAGAAC
CAGGGCGGCT ACATCGCCCA CTCGAGCTCG CTGTTCGCGA TGCTCTCCCG CCCCGGGGAG
GCCTACCCCG GCGGCGCCCT CCTGCTGCGC GCGACCACGC AGAACCAGCT CGTCTACGAG
CAGCGCATGC CCACCGCGAG CGCCGCGCCC GCCGGCGACG CGGAGGACGC CCTCGCGCCC
CGCAAGATCG TCGAGGCGAT CGGGCGCGCG CGGCGCGACC GCCGAGACTA CACGCCCGCC
TTCCTGGAAG GTCCGGTGCA CCCGAGCGAC GCGGTGGGCG GCTTCGCGGG GCGGGTGGTG
GTGCTCACCA GCCCGAGCTG CATGAGCGCC TGCGACCGGC TCGCTGGCCT GCTGCGCAGC
GCGCGGCGCG CCGTGCTCGT CGGCGGGCCG ACGGAGGGCG CGGGCGGCAG CCAGCAGGAG
GCGAAGAACC TGGGCGCGCG CTGGTCGGAT CCCGACGGGC TCCTGTCCGT GTCGATCCCG
AACGCGGCGA TGGGCGTGCA GCCCCTCGAG GCGGATGCGC CCCTCGCGGC GCAGGCCGAG
CGCTCCGCCG AGGAGTTCTT CACCGCGCTC GCCTTCGAGA ACCGGCCCGT GCAGCCGCAC
GTGCCGTACG CGACCACGGC GGCGGACGTG GCCGGACACA ACGGCGGCTG GCTCGAGCAG
GTGGAGGCCG TCCTGTTCGA GGGCGTCGCG GTCCGCGAGG GGATCGCGGG GTTCTAG
 
Protein sequence
MVAGAAHRAT RAQLDAQSAA PSTRERACPA SEGCWWSRAR AAEAGGARSH LRPPMALPAL 
LSLLFAVAAT PAAAHASAVP PPPALGGAAP VEPAASPAAP AADVCDGTAS LPEDLACGDA
PLCSARQRVK LACDLRDAME KRYVFFPVKG RLLARDGKPS FDSRAHLDAC VAEERAIPRE
DEPFRFYDRM RRCTAAFEDG HLLFGAPVRL PQVALGFGLR LTAGGEVLIA NREKRLLGHL
AKMPGLRGID EVLAVGNEVL EIDGAPAAER VEELARYVQG SSRAARLERA VDAMTRRDFA
YPRRRTTTIA VSVRGVRRVV ELPWWISPDA ARHPMTERYV RRTGIATNER LAWRYDKGAD
TWARDPGAAE GYLRTDTILP ARDAAALAEY LDEGDRVAMR LGEVVRRRDR AFCYAQVLSF
HTEQLHSDGG EPRAFTSVLE RFVSECKGKG LDLVVDLRQN QGGYIAHSSS LFAMLSRPGE
AYPGGALLLR ATTQNQLVYE QRMPTASAAP AGDAEDALAP RKIVEAIGRA RRDRRDYTPA
FLEGPVHPSD AVGGFAGRVV VLTSPSCMSA CDRLAGLLRS ARRAVLVGGP TEGAGGSQQE
AKNLGARWSD PDGLLSVSIP NAAMGVQPLE ADAPLAAQAE RSAEEFFTAL AFENRPVQPH
VPYATTAADV AGHNGGWLEQ VEAVLFEGVA VREGIAGF