Gene Acid345_2635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2635 
Symbol 
ID4072044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3106467 
End bp3108128 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content61% 
IMG OID637984652 
Productserine protease, kumamolysin 
Protein accessionYP_591710 
Protein GI94969662 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.250819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATGGGA ATCCGTGGCT CGCGTCAACC CGCGCTCAGC AAACTGCACG GTTCAGCATC 
GCAGTGCTAC TCTCATCGCA CCCGGTTAAC CCCATGACAG ACGCCATTCC CCCAACTCGT
CTCGATCTTG CCGATCTGAA TCGCGCTCCT CGCTCCGAGG AACAAGTCAT CGGCCGCACC
GCTCCTGATA CCCAACTCTC GGTGACTATC GTCTTGCGCC GTGCTACTGA CGCGGCGATG
CGCGCCGCCG ATCTCGCCGC CCTGCGCGAC TTCTCCATAC GACATAAGCT CGATCTCGAA
GACTCCGGAG ACCCGGACGA CTTCGTAACC CTGTGCGGAC GCGCGGCCGA TTTCGAATAT
GCGTTTCACT TTGAGTTGCT CGACGTGGAA CAGGATGGCG ACCGTTATCG CCGCTATACA
GAAACGCCGT CTCTACGGCC GGGAATCCGC GAAGTGGTCG TCGGCATTTT CGGACTTCGC
GACCGTCCCG CGCGTCCGCG TCCGCGGGTC GATCACGGCG GAACCACCGC GCCGTTCTGG
ACTGCAACCG ACCTCGAGCG TGCGTACTCG TTTCCCGAAG GGACCGACGG TGCCGGCCAG
ACAATCGCAC TCATCGAACT CGGTGGCGGC TACGACCCGC AAGACATAGC AGACTTGCTC
GCGAGCCTGG GCCGCCCGCT GCCACAGGTG ACTTTCCGGC CCGTTGCGAA CGCCCTCAAC
CAGCCTTGCG ACGCGGACAC GATTCAGCAG TGGCTCGATG TGATCGAAGG GCGCCTGCAA
TTGTCCGCTG TCGATCCGAA GGTACTCGAG GCCGCACAAG CCACCGCGGA AGTCACCATG
GACATCGAAG TCGCTGCCGC ACTCGCTCCC GGCGCCCACC TCGTCGTCTA CATGGCGCCG
CCTACCGAGC AGGGCCTCTA CAAAGCGCTT GACGCAGCGA TCCACGACAC GCCTCCGCTT
GTGGATGTCG TCTCCATCAG TTGGGGCGAA GCTGAGCTCT ATGTCTCCGA CGCCTACAGA
AAATCGCTCA CGCAACTGCT GGAAGACGCC GCCGCGCGCG GGATCACTGT CTGCGCGTCG
TCGGGAGATA ACGGCGCGTA TGACGATCCG CCCAATCAGA CTCTCTGCGT GAACTTTCCC
GCCAGTAGCC CACTCGTACT CGCCTGCGGC GGCACCACGA TCGCAAGTTA TAGTTCCGGA
ATCCAAAAAG AAGTCGTATG GAATTGCGGT GTGCATGGCA TCCACGCTGC CACCGGCGGT
GGCGTCAGCG AACACTTCCC GCTGCCAACT TGGCAGGACG CGAAACTCGT CCCAGCATCC
GCGAACGGAT ATCGCGGTCG CGGCGTGCCC GATGTCGCTG CTGTTGCCGA TCCTCATAAC
GGCTGCGAGA TTCTCGTGCG TGGAATTCGT TGCAGTTCGT TCGGCACCAG CGCCGTCGTG
CCCTTCTGGG CTGCGCTCAT CGCGCGTTGC AACCAAGCTT TGGGAAAACG CAGCGGCCAA
ATCCAGCCAA AACTGTACGA ACTCGCCAAG TCCGAGAGTT CACCGTTCCG AGCGATTTTA
GAGGGAGACA ATTTCTTCTA TCGTGCGGCG GCGGGATGGA ACCCTTGCAC AGGATTGGGA
GCTCCAGATG GAAGCCGACT ACTCACTGCT TTACGGAGTT GA
 
Protein sequence
MYGNPWLAST RAQQTARFSI AVLLSSHPVN PMTDAIPPTR LDLADLNRAP RSEEQVIGRT 
APDTQLSVTI VLRRATDAAM RAADLAALRD FSIRHKLDLE DSGDPDDFVT LCGRAADFEY
AFHFELLDVE QDGDRYRRYT ETPSLRPGIR EVVVGIFGLR DRPARPRPRV DHGGTTAPFW
TATDLERAYS FPEGTDGAGQ TIALIELGGG YDPQDIADLL ASLGRPLPQV TFRPVANALN
QPCDADTIQQ WLDVIEGRLQ LSAVDPKVLE AAQATAEVTM DIEVAAALAP GAHLVVYMAP
PTEQGLYKAL DAAIHDTPPL VDVVSISWGE AELYVSDAYR KSLTQLLEDA AARGITVCAS
SGDNGAYDDP PNQTLCVNFP ASSPLVLACG GTTIASYSSG IQKEVVWNCG VHGIHAATGG
GVSEHFPLPT WQDAKLVPAS ANGYRGRGVP DVAAVADPHN GCEILVRGIR CSSFGTSAVV
PFWAALIARC NQALGKRSGQ IQPKLYELAK SESSPFRAIL EGDNFFYRAA AGWNPCTGLG
APDGSRLLTA LRS