Gene Anae109_3916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3916 
Symbol 
ID5378306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4565694 
End bp4568939 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content72% 
IMG OID640845441 
Productcarboxyl-terminal protease 
Protein accessionYP_001381079 
Protein GI153006754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0708032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.35207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGATC CCAGACCCCC TACGCCCAGG CGCCCCCGCA TGATCCGCCC GTTCCGGCTG 
CTCACCACCC TCGCGGCCAC GGCCGCCCTG GCGCTCGGCG TCACCGTCCG GTTCGTGCGG
GCGGAGCCCG ACGCCGCCCC CGCGGTCGCG TACCGCGGCG CCGTGCCGGC CCACGCGAAG
GCCGACGCCG CCGGCGAGTA CCGGCTCGAT CGCCTGCCGA TCCTCTCCCA GGTGATCCTG
AAGGTGAAGG ACAACTACGT GGACCCCGCG CGGTTCGACC CGAAGAAGAT GGTCGTGGCG
TCGCTGGAGT CCGTCGAGCG AGCCGTCGCG GAGGTGATGG TCCAGGGCGA CGAGAAGTCG
CCCAAGCTGA CGCTCACCGT CGGGTCGTCG CAGCGGGACC TGGACCTCAC GGGCGTGGAC
TCGATCTGGA AGATCCGCCT CGTGCTGGGC GAGGCGATGG GCTTCATCCA GGACCACCTC
GTCGCGCACA AGGAGCTGAA GGACATCGAG TACGCGGCGG TGGCCGGGCT GCTCTCCACC
CTCGATCCGC ACACGAACCT CCTCGAGCCG AAGTACTTCA AGGAGATGAA GCTCCAGACC
CGCGGCGAGT TCGGCGGGCT CGGCTTCGTC ATCGCCATGC GCGACGGGAA CCTCACCGTC
GTGAAGGTGC TCAAGAACAC GCCGGCGCAG CGCGCCGGCA TCAAGGCGAA GGACGTCATC
GCTCGCATCG AGGAGCAGTC CACCGTCAAC ATGGACCTGC AGGACGCGGT GGATCGGCTG
CGCGGCAAGC CGCAGACGAA GATCTCGATC ACCGTCCAGC GCAAGGGCGC CGAGGCGCGC
AAGCTGAGCC TGCTGCGGGA GGTCATCAAC GTCGAGACCG TCGCGCAGGC GAAGCTGCTC
GAGGGCAACG TCGGCTACGT GCGGCTCTCG CAGTTCTCCG CCAACACCAC CCGCGACCTC
CTCGGCGCGC TCCAGCAGCA GCGGGCCCAG GCGGGCGGCA AGCTCGAGGG GCTCGTCCTC
GACCTGCGCG GGAACCCGGG CGGCCTGCTC GAGCAGGCGA TCCAGGTCTC GGACCTCTTC
CTCTCGCAGG GCGTCATCGT GAAGACGGTG GGCGGCGGCG ACCGGCAGCG CATCCACGAG
GTGAAGGAGG CGAGCAGCGA CGCGAGCGAC CTCGCGACGC TCCCCCTGGT CGTCATCGTG
AACAACAGCT CCGCCTCGGC GAGCGAGATC GTGGCGGGGG CGCTCAAGAA CAACGACCGC
GCGCTCGTCA TCGGGCGCCA GACCTTCGGC AAGGGCTCGG TGCAGGTCCT CGACGACCTC
GACGACCCCA CCGGCTCGGG CGAGCAGTCG GCGCTGAAGC TCACCATCGC GCAGTACCTC
ACCCCGGGTG ACCTCTCCAT CCAGGAGGTC GGCATCACGC CCGACGTGCT GCTCCTCCCC
GGCCGCGCGC TGAAGGAGCA GGTGAACTTC TTCGCGCCGC CGCGCTCGAT GGGCGAGGCG
GATCTCGACG GCCACCTGAT GAACCCGGGC GGCGCGGGGC CCGCCGAGGT CGCGAAGGCC
GAGGCGAAGA AGCGGCGCGT CGAGAAGGCG CCGCTCGAGC TGCGCTACCT GCTCGACGAG
AAGGAGGACC AGGTCGCGAA GGCGCTGAAG CGCGACCTCG CAGGCGACGC CGCCGCGGCC
CACGAGGACG TGTTCGAGCT CACGCCCGAG CAGGTCGAGG ACGAGGAGGC GGAGGCGGAT
CCCGACCGGC TCGTGGAGGA CTACCAGATC CGGTTCGCCC GCGAGCTCCT GAAGCGCGCG
CCGCAGCCCG CGCGCGCCCG CCTGATGGAG TCGGCGAAGG CGCTCGTGGC GGAGCGGCAG
GCGCAGGAGG ACGAGCGGCT CGAGAAGCGG CTGCGCGAGC TCGGCGTCGA CTGGACCGCC
GAGGCGCCGG CGGGCCGCGG CACGCCGCGC CCGGTCGTCA CGCTCACCCC GGCGCCCGGG
AAGGATCACC GCGCCGGCGA GACGGTCTCC TGGACGGTGA CGGTGGAGAA CAAGGGCGAC
GCGCCCTTCC GCCGGCTCCG CGCCTGGACC ACCGCCGAGA AGAACGCGCT GCTCGATCGG
CGCGAGTTCG TGTTCGGCAA CGTGCCGCCC GGCCAGCGCC GGAGCTGGAC CGTGCCGCTC
AAGCTGCCGA AGGGGATGGA CAGCCGGCGC GACGAGATCA CGCTCCACTT CGAGGAGGAC
GGCGGACACG CGCCCGCCGA CCTCCTGACG AGCGTCGGCG TGGTGGAGGT CGCGAAGCCG
GTGTTCGCCT TCAGCGTGCA GATCGACGAC CGCCAGTTCG GCAACGGCGA CGGGCTCGCG
CAGCGCGGAG AGACCTTCGA CGTGCGCCTC GACGTGCGGA ACGCCGGGAC CGGCGCCGCC
GGCGACAAGA CCTGGGTGTC CTTGAAGAAC CTCGGCGACG AGAAGCTCTT CGTGAAGAAG
GGGCGCGAGA GCCTCGGCGC GCTCAAGCCG GGCGAGACGA AGTCGGCGCG CATGGAGGTC
GAGCTCCGCC GCGGCTCCAA GAGCGACACG CTCCCGATCC GCGTGATGAT CGTCGACGAG
AAGATGGAGG AGTACGTCTC GGAGAAGCTC GACTGGCCGA TCGCGAAGGA CGAGCAGGCC
CGGACGCCGG CCTCGGGCGC GATCCGCGTC GAGGTGGCCG AGGCGACGCT CCGCTCCGGC
GCGAGCGCGG CGGCGCCGGC GATCGCGACC GCGAGGCGCG GGGCGCTCAT GCCGGTGGAC
GCGAAGATCG GCGAGTTCTT CCGGGTCGAG TGGCAGAAGG GCCGGTTCGC CTTCGTCCCC
GACTCCGAGG TCCGCGCCGC CCGCGGCACG CGCTCCGGGA CGATCGCCGC CGCGTGGCAG
CGGGAGCCGC CGCGCATCGC CCTCGTGCCG GACCCGCAGC GGGGCGCGCC GGTCGTCGAG
GGGGACAGCT GGAAGCTCCA GGGCAGCGCG CTCGTGCCGC CGTCGGCGGA CCCCGACGCG
CGCCTGCGCG ACGTGTTCGT GTTCGTGAAC GAGCAGAAGG TGTTCTTCAA GGTGCTGCCC
GAGGACGCGA CCACCTCCAA GCTCGACTTC TCGGCGGACA TCCCGCTCCA GGCGGGCAAC
AACGTCGTCA CCGTGTTCGC GCGGGAGGAC GACGAGTTCC AGAGCCGCAG GAGCATCGTG
GTCTACCGGC GCCCGCCCGC CGAGGTGGCG GCGGACGGCA CGCGCAAGAC GAGGCAGGCG
CAGTAG
 
Protein sequence
MPDPRPPTPR RPRMIRPFRL LTTLAATAAL ALGVTVRFVR AEPDAAPAVA YRGAVPAHAK 
ADAAGEYRLD RLPILSQVIL KVKDNYVDPA RFDPKKMVVA SLESVERAVA EVMVQGDEKS
PKLTLTVGSS QRDLDLTGVD SIWKIRLVLG EAMGFIQDHL VAHKELKDIE YAAVAGLLST
LDPHTNLLEP KYFKEMKLQT RGEFGGLGFV IAMRDGNLTV VKVLKNTPAQ RAGIKAKDVI
ARIEEQSTVN MDLQDAVDRL RGKPQTKISI TVQRKGAEAR KLSLLREVIN VETVAQAKLL
EGNVGYVRLS QFSANTTRDL LGALQQQRAQ AGGKLEGLVL DLRGNPGGLL EQAIQVSDLF
LSQGVIVKTV GGGDRQRIHE VKEASSDASD LATLPLVVIV NNSSASASEI VAGALKNNDR
ALVIGRQTFG KGSVQVLDDL DDPTGSGEQS ALKLTIAQYL TPGDLSIQEV GITPDVLLLP
GRALKEQVNF FAPPRSMGEA DLDGHLMNPG GAGPAEVAKA EAKKRRVEKA PLELRYLLDE
KEDQVAKALK RDLAGDAAAA HEDVFELTPE QVEDEEAEAD PDRLVEDYQI RFARELLKRA
PQPARARLME SAKALVAERQ AQEDERLEKR LRELGVDWTA EAPAGRGTPR PVVTLTPAPG
KDHRAGETVS WTVTVENKGD APFRRLRAWT TAEKNALLDR REFVFGNVPP GQRRSWTVPL
KLPKGMDSRR DEITLHFEED GGHAPADLLT SVGVVEVAKP VFAFSVQIDD RQFGNGDGLA
QRGETFDVRL DVRNAGTGAA GDKTWVSLKN LGDEKLFVKK GRESLGALKP GETKSARMEV
ELRRGSKSDT LPIRVMIVDE KMEEYVSEKL DWPIAKDEQA RTPASGAIRV EVAEATLRSG
ASAAAPAIAT ARRGALMPVD AKIGEFFRVE WQKGRFAFVP DSEVRAARGT RSGTIAAAWQ
REPPRIALVP DPQRGAPVVE GDSWKLQGSA LVPPSADPDA RLRDVFVFVN EQKVFFKVLP
EDATTSKLDF SADIPLQAGN NVVTVFARED DEFQSRRSIV VYRRPPAEVA ADGTRKTRQA
Q