Gene A2cp1_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_3939 
Symbol 
ID7297279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp4380571 
End bp4383792 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content72% 
IMG OID643596748 
Productcarboxyl-terminal protease 
Protein accessionYP_002494328 
Protein GI220919024 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0122378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCC CCTTCCGGCT GTTCGCCACC CTGGCGGCCT CGCTCGCGCT CGCGCTCGGC 
GTCACGGCCC GCTTCGTCCG CGCCGAGCCC GAGGCGCCTC CGCCGGCCGT CGCGTTCAAG
GGCGCCGCGC CGGTGCCGGC GCGCGCCGGC GGCAACGGCG ACTACCAGCT CGAGCGGCTG
CCCATCCTGT CGAGGGTGAT CCTCCAGGTG AAGGACAACT ACGTGGACCC GGGCCGCGTG
GATCCGAAGG GCATGGTGGT CGCGGCGCTC GAGGCGGTGG AGAAGACCGT CGCCGAGGTG
ATGGTGCAGG GCGACGAGAA GTCGCCCCGG CTCACGCTCA CGGTGGGCAG CGCCTCCCGC
GAGCTCGACA TCTCCGGCGT GAAGAGCATC TGGGAGATCC GCACCGTCCT CGGCGAGGCG
ATGGGCTTCA TCCAGCAGCA CCTCGTCGCC CACAAGGACC TGCGCGAGAT CGAGTACGCC
GCGGTGAACG GCCTGCTCCA GACGCTCGAC CCGCACACCG TCCTGTTCGA CCCGAAGTCG
TTCAAGGAGA TGAAGCTGCA GACCCGCGGC GAGTTCGGCG GGCTCGGGTT CGTGGTGGCG
ATGCGCGACA GCAACCTGAC CGTGGTCCGG GTGCTCCGGA ACACGCCGGC GCAGCGCGCC
GGGGTGAAGC CGAAGGACGT GATCGCGCGC ATCGAGGAGC AGTCCACCGT CAACATGGAC
CTGCAGGACG CGGTGGACCG CCTCCGCGGT CGGCCGCAGA GCAAGGTGGC CATCACCATC
CAGCGCCCGA CCCAGGAGCC GCGGCGGATG GTGCTGACCC GCGAGGTGAT CAGCATCGAG
ACGGTGCCGC AGGCGCAGCT CCTCGACGGC AACGTCGGCT ACGTCAAGCT CACCCAGTTC
TCCACCAACA GCACCCGCGA CCTGGTGCAG GCGCTCCAGC AGCAGCGCGC GCAGGCCGGC
GGCAAGCTGC AGGGCCTGGT GCTCGATCTC CGCGGCAACC CGGGCGGCCT GCTCGACCAG
GCCATCTCGG TCTCCGACCT GTTCCTCTCG GAGGGCGTGA TCGTCAAGAC GGTGGGCGAG
GGCGACAAGC AGCAGATCCA CGAGGTGAAG GAGGCGAGCG CCGAGCCCTC CGACCTCACC
GGCCTGCCCA TCGTCGTCAT CGTCAACAAC AGCACCGCCT CCGCCAGCGA GATCGTCGCC
GGCGCGCTCA AGAACAACGG GCGCGCGCTG GTGATCGGCC GCCAGAGCTT CGGCAAGGGC
TCGGTGCAGG TGCTCTACGA CTTCAGCGAC CCGAGCCGCC CCGCCGACGA GGCCGCGCTC
AAGCTCACCA TCGCGCAGTA CCTCACCCCG GGTGACGTCT CCATCCAGGA GGTGGGCATC
ACGCCGGACG TGCTGCTGCT GCCGGGCCGC GCGCTGAAGG AGCAGGTCAA CTACTTCGCC
CCGCCGCGCT CGATGGGCGA GGCCGACCTC GACCGCCACC TCACCAACCC GGCCGACCGC
ACCGTCCCCG AGGCGGCCCG CGCCGAGGCG CGCAAGAAGC GCCAGGACAA GCCGCCGCTC
GAGCTGCGCT ACCTGCTCGA CGAGAAGGAG GACCTGGTCG CGAAGCAGCT GAAGAAGGAC
GCGGCCGCCG AGGCCTCCCC GCACGGCGAC GTGACCGAGC TGACGCCGGA GCAGCAGGAG
GACGAGGACG CCGACGCCGA TCCGGATCGC TTCGTCGAGG ACTACCAGAT CCGCTTCGCG
AAGGACCTGC TGCGCCGCGC GCCCTACCCG GATCGCGCCC GCCAGCTCGA GGCCGCCAAG
GCGCTCGTGT CCGAGCGGCA CCAGCAGGAG GAGACGCGGC TGCAGAAGCG CCTCGCCGAG
CTGGGCGTCG ACTGGGCCGA CGGCCAGGCC AGCCGCGGCA ACCCGCGCGC GGTGGTGACG
GTGTCGCCGC CGCCGGGCAA GGAGCTGCGC GCCGGCGAGA CGATGCCCTG GACCGTGACG
GTCGAGAACC GCGGCGACGC GCCGTTCCGG CGCCTCCGCG CCTGGACCAC CGCCGACAAG
AACGGGCTGC TCGACCGCCG CGAGTTCGTG TTCGGCGCGG TGCGCCCGGG CGAGAAGCGC
ACCTGGACGG CCCCGGTGAA GCTGCCGAAG GGCATGGACA CCCGCCGCGA CGAGGTGACG
CTCCACTTCG AGGACGAGGG CGGCAAGGCG CCGCCGGACG TCACCACGGC GGTGGCGGTC
ACCGAGGTGC AGAAGCCGGT GTTCGCGTTC AGCGCCCAGG TGGACGACGC CAAGGGCGGC
AACGGCGACG GGCTCCCGCA GCGGGGCGAG ACGTTCAACC TCCGGGTGGA CGTGCGCAAC
GCCGGCCCGG GCGTCTCGGG CGACAAGACC TACGTGCTGC TGAAGAACCT CGGGGACGAG
AAGCTCTTCA TCAAGAAGGG GCGCGAGGTG ATCGGCGCGC TCAAGCCGGG CGAGGTGAAG
ACCGCCACCA TGGAGGTCGA GCTCCGCCGC GGCTCGAAGA GCGACACCTC GCCGGTCCGC
GTCACCATCG TGGACGAGAA GATGGACGAG TACGTCTCCG AGAAGCTCGA CCTGCCGGTG
GCGACGGACG AGCCGGCCCG CACCGCCGCG CACGGCGCGG TGCGCGTGGA GGTCGCGGAC
GCCCAGCTCC GCACCGGCGC GAGCGCCACC GCCCCGGTGA TCGCGGCCGC GCGCAAGGGC
GCGGTGCTGC CGGTGGACGC CCGCTTCGGC GAGTTCTACC GGGTCGAGTG GCAGAAGGGG
CGCTACGCGT TCGCCGCCGA GGGCGAGGTG AAGCCGCTGA AGGCGGCCGG GACGGCGCGG
AGCGGCGCCG TGGTCGAGGT GTGGCAGCGC GAGCCGCCGC GCATCGCGTT CTCGCCCGAC
CCGCTGAAGG GCGCGCCGGT GGTGGACGGC AACACCTTCA AGCTCCAGGG CACCGCCAGC
GTGCCGCCCT CGGCGGATCC GACGGCGCGC CTGCGCGACG TGTTCGTGTT CGTGAACGAG
CAGAAGGTCT TCTTCAAGGT CCAGCCGGAC ACGGCCACCT CCTCGAAGAT GGACTTCACC
GCCGACCTGC CGCTCAAGCC CGGCAACAAC GTCGTCACCG TGTTCGCCCG CGAGGACGAC
GAGTTCCAGA GCCGCCGGAG CGTGGTGGTG TTCCGCCGCA CGCCGCCGGA GGTCGCGGCC
GAGGCCGGGC GCGGGCAGGC GCAGCGGAGC AGCGGGCAGT AG
 
Protein sequence
MIRPFRLFAT LAASLALALG VTARFVRAEP EAPPPAVAFK GAAPVPARAG GNGDYQLERL 
PILSRVILQV KDNYVDPGRV DPKGMVVAAL EAVEKTVAEV MVQGDEKSPR LTLTVGSASR
ELDISGVKSI WEIRTVLGEA MGFIQQHLVA HKDLREIEYA AVNGLLQTLD PHTVLFDPKS
FKEMKLQTRG EFGGLGFVVA MRDSNLTVVR VLRNTPAQRA GVKPKDVIAR IEEQSTVNMD
LQDAVDRLRG RPQSKVAITI QRPTQEPRRM VLTREVISIE TVPQAQLLDG NVGYVKLTQF
STNSTRDLVQ ALQQQRAQAG GKLQGLVLDL RGNPGGLLDQ AISVSDLFLS EGVIVKTVGE
GDKQQIHEVK EASAEPSDLT GLPIVVIVNN STASASEIVA GALKNNGRAL VIGRQSFGKG
SVQVLYDFSD PSRPADEAAL KLTIAQYLTP GDVSIQEVGI TPDVLLLPGR ALKEQVNYFA
PPRSMGEADL DRHLTNPADR TVPEAARAEA RKKRQDKPPL ELRYLLDEKE DLVAKQLKKD
AAAEASPHGD VTELTPEQQE DEDADADPDR FVEDYQIRFA KDLLRRAPYP DRARQLEAAK
ALVSERHQQE ETRLQKRLAE LGVDWADGQA SRGNPRAVVT VSPPPGKELR AGETMPWTVT
VENRGDAPFR RLRAWTTADK NGLLDRREFV FGAVRPGEKR TWTAPVKLPK GMDTRRDEVT
LHFEDEGGKA PPDVTTAVAV TEVQKPVFAF SAQVDDAKGG NGDGLPQRGE TFNLRVDVRN
AGPGVSGDKT YVLLKNLGDE KLFIKKGREV IGALKPGEVK TATMEVELRR GSKSDTSPVR
VTIVDEKMDE YVSEKLDLPV ATDEPARTAA HGAVRVEVAD AQLRTGASAT APVIAAARKG
AVLPVDARFG EFYRVEWQKG RYAFAAEGEV KPLKAAGTAR SGAVVEVWQR EPPRIAFSPD
PLKGAPVVDG NTFKLQGTAS VPPSADPTAR LRDVFVFVNE QKVFFKVQPD TATSSKMDFT
ADLPLKPGNN VVTVFAREDD EFQSRRSVVV FRRTPPEVAA EAGRGQAQRS SGQ