Gene AnaeK_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_2069 
Symbol 
ID6786451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp2325720 
End bp2327291 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content73% 
IMG OID642763529 
Productprotease Do 
Protein accessionYP_002134426 
Protein GI197122475 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.719287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTCA TCACCCGAAT CGTCACCGTC TCTCTCGCAG CCGCAGCGAT CTTCGCGTGC 
ACGCGGGACG GCAGCGCGGC GACCGCATCC GCCGCGCCCG CGGCGGCGCC GGCCCAGCTG
TTCCGCGACG CCGCGGCGGC CGCGCCCGGC CCCGAGGCCG CCATCCCGGT GCAGACCTCG
CTCGCGCCGC TCATCGACAA GCTCCGCCCG GCGGTGGTGA ACATCTCCAC CACCACCGTC
ACCAAGCACC CGCGCGTCCA GCGCGGCCCA CGCGGCCAGA ACCCGCACGG CGGCGGCACG
CCGGACGAGG GCTTCGAGGA CTTCTTCGAG CGCTACTTCG GCCGCCCCGC GCCGGAGATG
CCCGAGGAGT TCAAGGGCTC GTCGCTCGGC TCCGGGTTCC TGCTCAACAC CGAGGGCTAC
ATCCTCACCA ACAACCACGT GGTGAAGGAC GCCACCGACA TCCGCGTGCG CCTCTCGGAC
GACCGCGAGT TCGGCGCCAG GATCGTCGGC CGCGATCCGC TCACCGACGT GGCGCTCATC
CAGCTCGTGA ACCCTCCGAA GAACCTGCCG ACGGTGGTGC TCGGCGACTC CGACGCGCTC
CGCCAGGGCG ACTTCGTGCT CGCGCTGGGC AGCCCGTTCG GCCTGCGCGA CACGGCCACG
CTCGGCATCG TGTCGGCGAA GCACCGCCCC GGCATCAACC CCGGCGGCAC CTACGACGAC
TTCATCCAGA CCGACGCCGC CATCAACCCC GGCAACTCGG GCGGCCCGCT GTTCAACCTC
CGCGGCGAGG TGGTCGGCAT CAACACCGCC ATCGTGTCGC CGCAGATCGG CCAGGGCATC
GGCTTCGCGG TGCCCATCAA CATGGCGAAG GCGCTGCTGC CGCAGCTCAA GGAGAAGGGC
AAGGTCACGC GCGGCTTCCT GGGCGTGTCG GTGTCCGACC TCTCGCCGGA TCTCATCCAG
GGCTTCGGCC TGCAGTCCGG CACCAAGGGC GCGCTGGTCC AGAACGTGGT CCCGCGCTCG
CCGGCCGACA AGGCGGGGCT GCAGCCCGGC GACGTGGTCG TCGCGCTGAA CGACAAGACG
GTCGAGACCG CCGGCGCGCT CACCCGCGGC GTCGCGCTGG TCGCGCCGGG CCAGACCGCG
AACCTGACCG TGCTGCGCGG CGGCCAGAAG AAGCAGTTCG CGGTGAAGGT CGTGCAGCGG
CCCGAGGACG GGGAGGCCGT CGGCCGCAAC GAGCAGGGCG GCGGCGACGA AGGCGGCGGG
CAGGGCGCCC GCGATCAGTC GCCGAAGCTC GGCGTCTCGA TCGCGCCCAT CACCCCGGAC
GTCGCGCGCC AGTTCGGCGT CGAGCCGGGC GAGGGCGTGG TGGTGGTGGA CGTCACCGAA
GGTGGCCCGG CCGATCGCGC CGGCATCCGC CGCGGCGACG TCATCCTCGA GGCGAACCGC
CAGAAGGTGG CGCGGCCGGA GGACATGCGG TCGGCGGTGG CGAAGCTGAA GGAGGGCGAC
ATGGCGCTTC TGCGCGTTCG CCGCGGCGAC GCCGCCGTGT TCATCGCGGT GCCGGTGGGC
GGCGGCAAGT AG
 
Protein sequence
MRLITRIVTV SLAAAAIFAC TRDGSAATAS AAPAAAPAQL FRDAAAAAPG PEAAIPVQTS 
LAPLIDKLRP AVVNISTTTV TKHPRVQRGP RGQNPHGGGT PDEGFEDFFE RYFGRPAPEM
PEEFKGSSLG SGFLLNTEGY ILTNNHVVKD ATDIRVRLSD DREFGARIVG RDPLTDVALI
QLVNPPKNLP TVVLGDSDAL RQGDFVLALG SPFGLRDTAT LGIVSAKHRP GINPGGTYDD
FIQTDAAINP GNSGGPLFNL RGEVVGINTA IVSPQIGQGI GFAVPINMAK ALLPQLKEKG
KVTRGFLGVS VSDLSPDLIQ GFGLQSGTKG ALVQNVVPRS PADKAGLQPG DVVVALNDKT
VETAGALTRG VALVAPGQTA NLTVLRGGQK KQFAVKVVQR PEDGEAVGRN EQGGGDEGGG
QGARDQSPKL GVSIAPITPD VARQFGVEPG EGVVVVDVTE GGPADRAGIR RGDVILEANR
QKVARPEDMR SAVAKLKEGD MALLRVRRGD AAVFIAVPVG GGK