Gene Acid345_4761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4761 
Symbol 
ID4070699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5625676 
End bp5627103 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content60% 
IMG OID637986805 
Productmicrocin-processing peptidase 2 
Protein accessionYP_593834 
Protein GI94971786 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.58875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0125504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGC AGTTCTTCTT TCAAAAATAC GGACTTACTG AACAGAACGT CGAAAAGTAC 
CTGGGCGCGG CGCTCTCGGC CGGCGGTGAC TATGCCGACC TCTATTTCGA ATACCTGACT
ACGACTTCTA TATCGCTGGA CGAGTCAATG GTGAAGTCGG CCTCGCAGGG GATTTCCGCC
GGGTGTGGGG TGCGCGTGGT CGCGGGTGAA CGGACGGGCT ATGCCTATAC CGACGACCTG
GCGCCGGAAA AGATCCTACA TGCGGCCAAG ACCGCGGCGC TGATCGCGAG TGGACCAGCG
AAGACGCCGA TCGTAAACGC CAACGAGCTC AGGAAGAAGG CCGACCTGTA TCCGATTACG
AGCGGAGTGC TGGGCTCGGA CGTCCTGGGC AAGCTCGAAC TGGTTAAGCG CTCGGACGAA
GCGGCGCGAT CCTACGATTC GCGGATCCAG GAAGTGCGCG TCAGCTACGC GGATGAGCTG
CGCAAGATCC TCGTTATTGG TTCCGACGGG ACGTTCGCCG AAGATGTGCA GCCGCTCTCG
CGGATGAGCG TGTTCTGCAT TGCAAAGAAC GATCTCGGTT CAGCTCGCGG CAGCGCCGGT
GGCGGCGGAC GCGTCGGAAG CGAATACTTC GAGGGTGAAG CCTCACCGGA ACACTTCGCG
AAGGAAGCCG CGCGACAAGC CATCATCCAA CTCGACGCTC GCGAGGCGCC GGCGGGAGAG
ATGGAAGTTG TGCTGGGACC GGGATGGCCG GGAATTCTCC TGCACGAAGC CATTGGTCAC
GGGCTGGAAG CGGACTTCAA CCGCAAGAAG ACATCGGCGT TTGCCGGATT GATGGGGCAA
CGCGTGGCGA GCGAGAAATG CACCGTGGTG GACAACGGAA CTATGCCGAG CCGTCGCGGA
TCGCTCAACG TGGACGACGA AGGCAATCCG ACAAACAACA CCGTGCTGAT CGAGAATGGC
ATTCTTAAGG GCTATCTCAC GGACAAACTC TCGGCGCGGC TGATGGGCAT GGCGAACACT
GGCAACGGAC GCCGCGAAAG TTATGAGCAC ATTCCCATGC CGCGCATGAC CAACACCTAC
ATGCTCGCGG GGCAGGACGA TCCCGAGGAC ATCATCAAGA GCGTGAAGTA CGGCGTATAT
GCCGTGAACT TCGGCGGCGG ACAGGTGGAC ATCACGAACG GCAAGTTCGT GTTTGCCGCG
AGTGAGGCAT ACCTGATTGA GAACGGCCAG GTCACGGCGC CGCTCAAGGG CGCGACGCTG
ATCGGCAATG GGCCCGACGT GCTGACGCGC GTCAGCATGG TTGGCAACGA CCTGAAGCTC
GATGAAGGCG TGGGTACTTG CGGCAAGGAT GGGCAGTCGG TGCCTGTCGG CGTGGGTATT
CCTACGTTGA AGGTGGATCG GCTGACGGTC GGAGGCACGG GACGATGA
 
Protein sequence
MDKQFFFQKY GLTEQNVEKY LGAALSAGGD YADLYFEYLT TTSISLDESM VKSASQGISA 
GCGVRVVAGE RTGYAYTDDL APEKILHAAK TAALIASGPA KTPIVNANEL RKKADLYPIT
SGVLGSDVLG KLELVKRSDE AARSYDSRIQ EVRVSYADEL RKILVIGSDG TFAEDVQPLS
RMSVFCIAKN DLGSARGSAG GGGRVGSEYF EGEASPEHFA KEAARQAIIQ LDAREAPAGE
MEVVLGPGWP GILLHEAIGH GLEADFNRKK TSAFAGLMGQ RVASEKCTVV DNGTMPSRRG
SLNVDDEGNP TNNTVLIENG ILKGYLTDKL SARLMGMANT GNGRRESYEH IPMPRMTNTY
MLAGQDDPED IIKSVKYGVY AVNFGGGQVD ITNGKFVFAA SEAYLIENGQ VTAPLKGATL
IGNGPDVLTR VSMVGNDLKL DEGVGTCGKD GQSVPVGVGI PTLKVDRLTV GGTGR