Gene Acid345_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0572 
Symbol 
ID4073061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp699667 
End bp700836 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content59% 
IMG OID637982577 
ProductSte24 endopeptidase 
Protein accessionYP_589651 
Protein GI94967603 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.209932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTG TTTCGACCCT AAAGCCCGAC AGCCTCGAGG CACGCCGCTA TAACCGTCTC 
AAACGATGGC TGGAGGTGTC AGACCTGATC GTCGGTTTCG TGCTGCTGCT GGCCCTGGTC
CTCACCCACG GCAGCGCGCG GCTGCGCGAC CTCGCGTATC TCGCGTCGCG ACAGTATTAC
TCCATCGCCG TATTCATGTT CGTGCTTTTC CTGCTGCTCA TCAGCAAGGT GCTCTCGCTG
CCGATCGATT ATTACGGCTT CCGCCTGGAG CACGAGTTCA AGCTCTCGAA TCAGAAACCT
GGCGCGTGGC TTTGGGATGA GTTGAAGGGC TGGCTCGTCG GGCTGGTTAT CCTTACGATT
CTCGTCGAGG TACTTTACGC GACGATCCGT CTCTACCCGG ATTATTGGTG GCTGGTTGTG
TGGGCAGTAT TCATCGGGTT CACCGTCCTG CTGGCGCAGC TTGCGCCGGT GGTGTTATTC
CCGATCTTCT ACCGTTTTGA GCCGCTGAAA AACGATGCCC TCCGCGAGCG ACTGGTGAAG
CTCGGAGAGA AGGCGGGAAC CAAGGTCCGC GGCGTGTACG AGTGGAAGAT CTCGGAGAAA
TCGAAGAAGG CAAATGCGGC GCTGACGGGC CTGGGGAAAA CGCGGCGAAT CATTATCGCC
GATACTCTGC TCGAAAATTA CAGCGACGAC GAGATCGAGG CGGTGCTGGC GCATGAGCTG
GGACATCATG TGCACGGGCA CATCGCGAAG GGAATCCTGG TGCAGGTGGG GATTACGTTC
GTGGGCTTCT GGGCGTCGCA CATCATCCTG CGGTATGTCG TGGACCAGCG TCAGATGTTT
CAGTCAATGT CGGACTTTGC GAACTTGCCC CTATTGGCGC TGATTGCCGC GGTGCTGGGT
TTGGTGCTGA CACCGGTGCT GAACGCGTAC TCGCGCTACA ACGAGCGGCA GGCCGACTCG
TATGCGTGGA AGTCGATACC CTCGGTTGAG CCATTCGTGA CGTCGATGCA CAAACTAGCG
AGCCAGAATT TGGCAGAAGA GAACCCGGCG CGATGGATCG AAGTGCTGTT CCACTCGCAT
CCTACGATTG CGAAGCGAGT GGAAGCGGCG GAGAAGTGGC GGGAGCGGCA GGCCGTCCCG
CCAAGCGAGA CACCCGCGAC ATCGGTTTAA
 
Protein sequence
MNTVSTLKPD SLEARRYNRL KRWLEVSDLI VGFVLLLALV LTHGSARLRD LAYLASRQYY 
SIAVFMFVLF LLLISKVLSL PIDYYGFRLE HEFKLSNQKP GAWLWDELKG WLVGLVILTI
LVEVLYATIR LYPDYWWLVV WAVFIGFTVL LAQLAPVVLF PIFYRFEPLK NDALRERLVK
LGEKAGTKVR GVYEWKISEK SKKANAALTG LGKTRRIIIA DTLLENYSDD EIEAVLAHEL
GHHVHGHIAK GILVQVGITF VGFWASHIIL RYVVDQRQMF QSMSDFANLP LLALIAAVLG
LVLTPVLNAY SRYNERQADS YAWKSIPSVE PFVTSMHKLA SQNLAEENPA RWIEVLFHSH
PTIAKRVEAA EKWRERQAVP PSETPATSV