Gene Acid345_1435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1435 
Symbol 
ID4068814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1733349 
End bp1734986 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content61% 
IMG OID637983444 
Productpeptidase S1C, Do 
Protein accessionYP_590511 
Protein GI94968463 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.600827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGC CCACCAGCGG TTCCTGGCAG CGCCTGAGAG CCAATCGGTT CGCTTCTGTT 
CTTGTGATCC TGGCCACTCT CTCGCTCGGC ATTCTGATCG GAACCGTGAT CTCTTCGACT
GTGAAGGGCA ACGAAAAACA GGTCAGCAGC TCGGACGCAA CTCCGCTGCA GATCCCTGAG
CCCAAGCAGC TTTCCAACCA GTTCGCGCAG ATTGCGAAAC AGCTCGAACC GGCGGTGGTC
AACATCAACA CCGAGTCCAC CATGAAGCAC CCTTCCATCA AGGGACGACG CGGCCAGCAA
ACGCCTCCCG ATGACGACGA GGATAATCAG GACGATCAGG ACCAAGGCCC GGGCGGCGGG
CAGGACAGCC CCTTCCAGGA CTTCTTCGAT CGCTTCTTCG GTGGCCAGGG TGGCGGCGGG
CAAATGCCCC AGCAGGACCT CCGTCAGCGC GCCCTTGGCT CCGGCATCAT CATTGACCCG
AAGGGCTACA TCATCACCAA CGATCACGTG GTTGATAAAG CCGACAAGAT CAAGGTCAAC
CTCATGGGCG ACCCTGAGAC CGTCAGCTAC GACGCTACGG TCATCGGCGT GGACAAGGAA
ACCGATCTCG CCGTCATCAA GATCAACGTG AAGCACGATC TTCCTTACGC GAAGCTCGGC
AACTCCGAGG GCGTACAGGT CGGTGACTGG GTTCTTGCCC TCGGCAGCCC CTTCGGTCTT
AACTCGACCA TGACTGCCGG AATCGTCTCC GCCAAGGGCC GCAACATCGT CCCGCAGCGC
CAGTTCCAGC AGTTCATCCA GACCGACGCC GCCATCAACC CCGGCAACTC CGGCGGTCCG
CTCGTGGACA TGGCCGGTGA GGTCATCGGC ATCAACACCG CGATCTTCAC CACCGGCGGC
GGCTACCAGG GTGTTGGCTT TGCGCTACCC TCCAACACGG TCATACAGGT TTATAACCAG
CTCATCGCGC CCGATCACAA GGTCTCGCGC GGCTCCATCG GCGTGGAATT CAACGCGGTA
GCGAATCCCG CGGTAGCGCG TGTTTACGGC GTCACCACGG GCGTTACGGT AGCCAACGTC
ACTCCCAATG GACCGGCACA AAAGGCCGGC ATCCAGACGG GCGACACCAT CGTTTCTGTG
GATGGCAAGC CCGTAAAGAA TGGCGATGAA CTCGTCGCTG ACATCTCTGC GCGCAAGCCG
GGCTCGACCG CGAAGGTCGG CTTCGTTCGC AACGGCAAGG AACAGTCTGC AAGCGTCACG
ATCGCGGATC GCTCCAAGCT CTACGCCGCG CGTCTTGGCG GCGGTGGCGA AGAGCAGGGT
GAAGGCGGCG AAGGCCAGCC CCAGCCCAGC AAGTTCGGTG CGACCGTGCA GAACATTACG
CCTGAGATGG CGCAGCAGTT GAAGCTGCCC AACACCAAGG GCGTTGTGGT CAGCAACGTG
AAGCAGGACA GCTTTGCGGA GTCTGTCGGC CTTGGCCGCG GCGACGTGAT CCTCGAGATC
AACAAACAGC CCGTCACCAA CGAAGACGAT TTCCGCCGCA TTCAGGGCAG CCTCAAGAGC
GGTGCTGACG TCGTCTTCCT CGTCCGTCCT CGCGGACGCG ATAACGGAAC CATTTTCATG
GCCGGAACCT TGCCGTAA
 
Protein sequence
MTPPTSGSWQ RLRANRFASV LVILATLSLG ILIGTVISST VKGNEKQVSS SDATPLQIPE 
PKQLSNQFAQ IAKQLEPAVV NINTESTMKH PSIKGRRGQQ TPPDDDEDNQ DDQDQGPGGG
QDSPFQDFFD RFFGGQGGGG QMPQQDLRQR ALGSGIIIDP KGYIITNDHV VDKADKIKVN
LMGDPETVSY DATVIGVDKE TDLAVIKINV KHDLPYAKLG NSEGVQVGDW VLALGSPFGL
NSTMTAGIVS AKGRNIVPQR QFQQFIQTDA AINPGNSGGP LVDMAGEVIG INTAIFTTGG
GYQGVGFALP SNTVIQVYNQ LIAPDHKVSR GSIGVEFNAV ANPAVARVYG VTTGVTVANV
TPNGPAQKAG IQTGDTIVSV DGKPVKNGDE LVADISARKP GSTAKVGFVR NGKEQSASVT
IADRSKLYAA RLGGGGEEQG EGGEGQPQPS KFGATVQNIT PEMAQQLKLP NTKGVVVSNV
KQDSFAESVG LGRGDVILEI NKQPVTNEDD FRRIQGSLKS GADVVFLVRP RGRDNGTIFM
AGTLP