Gene Acid345_2567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2567 
Symbol 
ID4070530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3030188 
End bp3031366 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID637984584 
Productpeptidase M48, Ste24p 
Protein accessionYP_591642 
Protein GI94969594 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGA AACGCCTGCT AGTGACGGCT GCTTTCGCGC TTTCAAGTTT CGGCATGGCG 
CAGTCCGCGC CCGCGCATCC TGACACTCAA TCCCAGCAAC CCGCAGCCAC ACAAGGTGGT
GAGACGCCAG ACGCGCAGCA GGCACCCGCC GATGACAGCA CAGCCGCGCC CGCCGCGGGC
ACCGACTCCA CCGCGACCAA GGATCCTCTC GCCAAGGAGG CGACCGAAAA TCTGCCGAAA
GCGAGTGGCG TGAAACACGA TGGCTCGCGC GATGACGTCT CGGCGATTGG CAATCGTCAC
ATCGGTCAGG GAACAAGTCC GGGTGACTGG TACTCGCTGG AGAAAGAAAT TCGGATGGGC
AAAGGCTATT CCGAACAGAT CGAAGCCGGC ATGAAGCTGG TGCAGGACCC GGTTGTGACG
GAATACGTCA ACCGCATCGG CCAGAACATC GTTCGCAATT CCGACGCGCG CGTACCGTTC
ACGATCAAGG TCGTGGATTC GGACGAGATC AATGCCTTCG CGCTGCCGGG CGGCTTCTTC
TACGTGAATT CAGGGCTGAT CCTGGCGGCT GATGAAGAAG CGGAACTCGC CGGCGTGATG
GCCCACGAGA TCGCTCACGT TGCTGCGCGT CACGCGACAC GGCAGATGAC CCGGGGTCAG
ATCGCGAACT TGGCGACGAT TCCTCTGATC TTTGTCGGTG GTGGTCTCGG CTATGCGCTG
CAGTCCGCGG TCAGCCTCGC ACTGCCAATG ACCTTCCTCA AGTTCTCGCG CGGCTTTGAA
GCCGAAGCCG ATTACCTCGG CCTCCAGTAC ATGTACGCGA CGGGCTACGA TCCGCAGGCG
TTCATTTCCT TCTTCGAAAA GGTGCAGGCG AAGGAAAAGA AGAAGCCGGG TACGTTGGCG
AAGGCGTTCT CAACCCATCC GCAAACGCCG GACCGCATTG AGAAATCTCA AGAAGAGATT
GCTCGCATCC TGCCCGCCCG CGACCAGTAC ATAGATGACA CTTCGGAATT TCAGACAGTG
AAATCCCGTC TTGCCGCCCT CGAGAATCGG CATAAGGTCG AGGACCAGAA GGAAAACCGT
CCGACGCTGC GGCGCACCAC GGCTGACAAC ACTGGCAGTA CGGACAGCAA GGGCACGAAC
GACGATGATC GCCCGACGTT GAAAAAGCGC GACGACTAA
 
Protein sequence
MKLKRLLVTA AFALSSFGMA QSAPAHPDTQ SQQPAATQGG ETPDAQQAPA DDSTAAPAAG 
TDSTATKDPL AKEATENLPK ASGVKHDGSR DDVSAIGNRH IGQGTSPGDW YSLEKEIRMG
KGYSEQIEAG MKLVQDPVVT EYVNRIGQNI VRNSDARVPF TIKVVDSDEI NAFALPGGFF
YVNSGLILAA DEEAELAGVM AHEIAHVAAR HATRQMTRGQ IANLATIPLI FVGGGLGYAL
QSAVSLALPM TFLKFSRGFE AEADYLGLQY MYATGYDPQA FISFFEKVQA KEKKKPGTLA
KAFSTHPQTP DRIEKSQEEI ARILPARDQY IDDTSEFQTV KSRLAALENR HKVEDQKENR
PTLRRTTADN TGSTDSKGTN DDDRPTLKKR DD