Gene Acid345_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3298 
Symbol 
ID4072710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3905994 
End bp3907529 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content61% 
IMG OID637985319 
Productpeptidase S1C, Do 
Protein accessionYP_592373 
Protein GI94970325 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGA ACAATCTGTT GGGAACATTG AAGGCGAAGA TCGGCCGCCG TTTCTATGCC 
AGCGTGCTGG CAGGAGCGGT AGCGTTTTCT CTCGCGAGCT ATGAATTTGC TGGCCACGCA
CGCGCAGCAA CACCGAGTCC GGCAGCCGCA GCGCTCGACG ACAACAGTGT GGGCGCGCTG
CTCTCGCTCG ACAAAGCCAT GGAGAACCTC GCCGCGCGCG TAACGCCGGC GACAGTGAAC
GTAACCGTTA CGTCGAAACG TTCCGCACAT AACGCATCCA TGCAAGGCAC GGAGGATGGC
GACGACGATG GCAATCCGAT GCAGCAGTTC GGACCATTTC AGTTCGGACC GCAGCGTCGC
CAGCCGCAAT ACGAGCATGG CCTCGGTAGC GGCGTGATCA TCTCGCCGGA TGGATACATC
GTCACCAACA ACCACGTGAT CGATGGTGCG ACTGACATTC GCGTCACCCT CACGGACAAA
CGCATCCTGC CCGCGAAATT GATCGGCGCC GATCCGCTGA CTGACCTGGC TGTAATCAAA
GTCGAGGGCA GCAATATGCC AAGCGTGCCG CTTGGTGATT CGACTTCCCT GCATCCGGGC
CAGACGGTGC TTGCCTTCGG CAATCCGCTT GGCTTCCGCT TCACGGTGAC GCGCGGCATC
GTCAGCGCAT TGAATCGGCC GAATCCCTAC GCGCAGGACC GTCGTTCTCC GGGACAGTTC
ATTCAGACCG ACGCGGCGAT CAATCCCGGC AACTCCGGTG GGCCGCTGGT GAACGCCCAC
GGTGAAGTGA TCGGGATCAA CACGTTCCTC ATCTCGGAGA CCGGCGGATT CTCGGGAATG
GGATTCGCGA TTCCCACGCA GATCGTGAAG CCGACGGTGG ACAGCCTGAT CAAGTACGGC
AAAGTGAACC ATGGATACAT GGGCATCGGG ATCAGCGATG TATCGCCGGA CGAGGCGAAG
TTCTTCAACG TGACCGACGC AAACGGCGCC GTGGTAACGC AGGTGGAACC GAATTCGCCG
GGCGCGAAAG CCGGCTTGAA GGTTGGTGAC ATCATCACTG CTGTGAACGG CAAGCAAGTC
GCAGACGCCG GCGCACTGCA AGTGGAAGTG GGCCAGCAGC AGCCCGGGAC CAAACTCGAC
CTGACGGTGA AGCGCGACGG CAAAGCCTCG ACGCTGAACG TAACGCTGGC CTCGATGGAC
AAAGGTGATC GGGACAACGA AACGGCGAGC GCAGGTCATG GCAAGCCGCG GTGGGGAATC
GGATTAGCCG ACCTGTCGCC GGAAGCGCGT CAGCAGTTGC AGGCGGGTGA TTCGGTGCAG
GGTGCGCTGG TAGGACAAGT AACGCCCGGC AGCCCAGCCG ACAACGCGGG ATTGCAGCCC
GGCGATGTGA TCACCGAGGT GAATCGCAAG CCGGTGAAAT CTGCGAGTGA CGCAAAGGAC
GCGCTGAGCG GAATCGCGAA TGGCGGTGAC GCGCTGGTAC TGGTGTGGTC GCGCGGCGGC
AGCAGCTTCC GGGTGTTGCA CGCGAGCCAG GGATAG
 
Protein sequence
MKVNNLLGTL KAKIGRRFYA SVLAGAVAFS LASYEFAGHA RAATPSPAAA ALDDNSVGAL 
LSLDKAMENL AARVTPATVN VTVTSKRSAH NASMQGTEDG DDDGNPMQQF GPFQFGPQRR
QPQYEHGLGS GVIISPDGYI VTNNHVIDGA TDIRVTLTDK RILPAKLIGA DPLTDLAVIK
VEGSNMPSVP LGDSTSLHPG QTVLAFGNPL GFRFTVTRGI VSALNRPNPY AQDRRSPGQF
IQTDAAINPG NSGGPLVNAH GEVIGINTFL ISETGGFSGM GFAIPTQIVK PTVDSLIKYG
KVNHGYMGIG ISDVSPDEAK FFNVTDANGA VVTQVEPNSP GAKAGLKVGD IITAVNGKQV
ADAGALQVEV GQQQPGTKLD LTVKRDGKAS TLNVTLASMD KGDRDNETAS AGHGKPRWGI
GLADLSPEAR QQLQAGDSVQ GALVGQVTPG SPADNAGLQP GDVITEVNRK PVKSASDAKD
ALSGIANGGD ALVLVWSRGG SSFRVLHASQ G