Gene Apre_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0185 
Symbol 
ID8396936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp210708 
End bp211718 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content41% 
IMG OID644994523 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003151958 
Protein GI257065702 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATT TTTATACTAT GGGAATTGAG ACAAGCTGTG ACGATAGTTC TGTTGCGATT 
CTCAAAAACG ATAGAGAAGT ATTGGTGAAT TTGATATCAT CACAGATTGA TATTCATGCA
CTTTTCGGAG GAGTTGTGCC AGAGATTGCA AGTAGAAAGC ATCTCGAAGC TATAAATCCC
CTAATAGAGA AGGCCTTAGC CGATACTAAT TTAAGTTATG ATGACATAGA TCTTATATCT
GTAACCAAGG GACCAGGACT TATGGGGTCG CTCTTGGTTG GGATTTCTGC AGCTAAGGGT
CTATCTCTTG CTACAGGTAC TCCTTTGATT GGTGCCAATC ACATGCAAGG GCATATTTGC
GCCAACTATT TGTCAAACAA GGACCTAGAA CCTCCCTTCA TAAGTCTAGT CGTATCCGGA
GGTCATACCT ACCTATGTAA GGTCAATTCC TACACTGACT ATGAAGTCAT AGGAAAAACC
TTAGATGATG CAGCAGGAGA ATCCTTCGAT AAGGTTGCAA GAAAAATTGG ACTAGGCTAT
CCAGGAGGAC CAAAGATCGA TAAGCTAGCC AAAGAAGGAA ATAAGGACGC TATAGACTTT
CCTAGGGTGA TGTTAGATAA GGGATCTTAT GATTTTTCCT TCTCAGGTCT TAAGACAGCA
GTCCTAAACT ACGCCCACAA GCTTGAACAA AGGGGAGAAG AAGTAAACAA GGCTGACCTT
GCAGCGAGCT TTCAAGAAGC TGTTGTCGAT GTCTTGGTAG ATAAGTCCAT GATGCTTCTT
AAAGAAACAG GCCTTAAGAC TCTTGCCGTA AGCGGGGGAG TTGCTGCAAA CTCTAGGCTT
AAGGAAAGAC TTAAGGAAGA ATGCGATAAG GAAGGAATCA AATTCTACCA TCCATCTGTA
ATTTTGTGCA CAGATAATGC GGCAATGATT GCCATGGCGG GTTTCTTAAA TTATAAAAAC
GGAGTCGTAG ACGATAATTT CATGAAAGTC TACCCGAATT TGGAATTATG A
 
Protein sequence
MSDFYTMGIE TSCDDSSVAI LKNDREVLVN LISSQIDIHA LFGGVVPEIA SRKHLEAINP 
LIEKALADTN LSYDDIDLIS VTKGPGLMGS LLVGISAAKG LSLATGTPLI GANHMQGHIC
ANYLSNKDLE PPFISLVVSG GHTYLCKVNS YTDYEVIGKT LDDAAGESFD KVARKIGLGY
PGGPKIDKLA KEGNKDAIDF PRVMLDKGSY DFSFSGLKTA VLNYAHKLEQ RGEEVNKADL
AASFQEAVVD VLVDKSMMLL KETGLKTLAV SGGVAANSRL KERLKEECDK EGIKFYHPSV
ILCTDNAAMI AMAGFLNYKN GVVDDNFMKV YPNLEL