Gene Acid345_4760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4760 
Symbol 
ID4070698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5624297 
End bp5625679 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content62% 
IMG OID637986804 
Productmicrocin-processing peptidase 1 
Protein accessionYP_593833 
Protein GI94971785 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.447651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0115347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTA CGACCCCGGT CCAAAGCGTC TCGACGGTAG ACCTGCGCGA ACTCTCCGCG 
GACGTAGTAC GCAAGGCGAT GAAGGGCGGC GCGACCGCCG CCGAAGTCGT GCTGCGCGAC
GGGTCGGAGT TCTCGACAAC CGTGCGTCTC GGCGAAGTGG AGACGCTGAA GGAATCGGGC
TCGCGTTCGA TGGGCGTGCG CGTGTTCTTC GGTAAGCGCT CGGCGAGCAC CTGGTCGAGC
GACTTCTCGC CGCAGGGCAT CGACAACATG GTGCGCGGTG CGCTCGATTT GGCGAAGATC
ACTTCGGAAG ACCCGTTCTC GGGGATCCCG GAACCCGAGC AACTTGGGCA GATCACAACC
AACCTCGATC TGTATTACGA GGATGTTTAC TCGCTTTCGA CGGCCGATCG CATTGATTAC
GCGAAGCGCG CGGAGCGGGC CGCAATGGAA GCCGATCCGC GGATCCAGAA CTCCGATGGA
GCCAGCTTCG ACGCAGCGAA TGGCGTGAAG GTACTCGCGA ATTCGAATGG CTTCATCGGC
GATTACAGCC GCTCGTACTG CTCGGTGTCG GCGGTGCCGA TTGCGCAGCA GGAAGGCTCA
GCGATGCAGC GCGACTACTG GTTCTCGGTT GCGCGGACGC TGAGCATGCT CGAATCGCCG
GAAGACGTGG GCCGCGAAGC GGCGCGCCGT GCGCTGCGAA GGTTGGGCGC GAAGAAGGTG
AAGACGGCGC GGGTTCCGAT CGTGTTCGAT CCGCTGGTGG CTGGGTCACT GCTCGGGCAT
ATCTTCGAAG CGGTGAATGG CGACAGCGTG TATCGTGGCG CGTCGTACCT GGCGGGCAAG
CTCAACGAGA AAATTGCCGG CGCCAATGTG ACGATCGTCG ACGATGGAAC GATGGTCGGC
GGTTTCGGGT CGAGTCCCTT TGATGCCGAG GGTGTGCCGA CGCGACGCAC AGTGGTGATT
GAGAAGGGCA TTTTGAATTC TTACCTGCTC AATACCTACA CTGCGAAGAA GTTGAAGCTA
CAGACGACGG GTAATGCCGC GCGCGGATTG GCGGGCACCC CGGGAATCGG CGCCGGGAAC
TTCTTTATGG AGCCTGGCAC GCGCACGCCG CAGCAGATTT TTGCCGATGT GAAAGACGGT
TTCTACGTCA CGGAATTCCT TGGCTCCGGC GTGAACCTGG TGACCGGCGA TTTTTCGCGG
GGCGCCAGCG GCGTGTGGAT CCAGAACGGG GAACTCACGT TCCCGGTGGA AGAGGTCACG
GTTGCCGGGA ATCTGCGCGA CATGCTGCAC AGCGTGGTGG AGATCGGCAA CGATCTCGAA
TTCCGTGGGT CGGTGGCCTG TCCGACTATG CGGATTGATG GGATGACGGT GGGCGGCGAG
TAG
 
Protein sequence
MSTTTPVQSV STVDLRELSA DVVRKAMKGG ATAAEVVLRD GSEFSTTVRL GEVETLKESG 
SRSMGVRVFF GKRSASTWSS DFSPQGIDNM VRGALDLAKI TSEDPFSGIP EPEQLGQITT
NLDLYYEDVY SLSTADRIDY AKRAERAAME ADPRIQNSDG ASFDAANGVK VLANSNGFIG
DYSRSYCSVS AVPIAQQEGS AMQRDYWFSV ARTLSMLESP EDVGREAARR ALRRLGAKKV
KTARVPIVFD PLVAGSLLGH IFEAVNGDSV YRGASYLAGK LNEKIAGANV TIVDDGTMVG
GFGSSPFDAE GVPTRRTVVI EKGILNSYLL NTYTAKKLKL QTTGNAARGL AGTPGIGAGN
FFMEPGTRTP QQIFADVKDG FYVTEFLGSG VNLVTGDFSR GASGVWIQNG ELTFPVEEVT
VAGNLRDMLH SVVEIGNDLE FRGSVACPTM RIDGMTVGGE