Gene Ccur_10940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_10940 
Symbol 
ID8375301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp1242492 
End bp1244111 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content52% 
IMG OID644994016 
Producttrypsin-like serine protease with C-terminal PDZ domain protein 
Protein accessionYP_003151467 
Protein GI256827508 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000345024 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones153 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA ATGGAATTCC TCCCGTAGGA CCGACGGGCT CTTCATATAC GGGATATACC 
GGACAGCCGG GTGCGCCGAT TCCACCGCAG TCGACTCAAC CGATGACTTC AGCGACACAA
ACGGCACAAA CAATGCAGGC AGCACAATCG GCTACAACAG CGCAGCCAAC CCAGGCGACA
CAGGCAACTT CGTATGCAAC CGTGCCGCCT CAATTTACCT CCGTTCCTCC GCAACCTCCA
ACCGTTCCAC CAGCGGGCAC AACAACTGCT ATACCAGCAG CTCATGTGCC CCAGACCGGT
GTTTCAGGTA AGCGTGTCTT TGGTATTGCG TTTATCGGTG CACTTGCTGC GTGTGCGGTT
GCTGCCGCGT GTTTTCTTGG CTATCAGTCG ATTACGGGGC ATTCGGCTTC CCAGGTGGTA
CTGGGGTCTT CCAGTGACAG CACCATTACC GCAAGCGACG ACGAGACTGA TCGTGCTGAA
AAGGTAGCTG ACAAGACGCT TCCATCGGTT ACAGCTATTA ATGTGTACAC CGACCAATCG
GGGTGGGGCA GCATGTTGGG TCGTTCAACC GATCAGTCGA GTTTGGTTAA GACAAGCCTG
GGCAGCGGCG TGATTATTTC TTCCGATGGC TACATTCTGA CAAATAACCA TGTGGTTGAA
TCAGCTGATG CATTAAAGGT AACGGCCAAT GGGCAAGAAT ATGATGCCAA GGTTGTGGGT
ACTGACCCAA CTACTGACCT GGCAGTCATT AAGATTGATG CAACGGGTTT GACCCCCATC
GAGATCGGCA AATCGTCCGA TTTGAAGGCT GGTCAGTGGG TCATGACCGT TGGCAGCCCC
TTTGGTCTTG AACAGTCAGT GGCGACGGGT ATTATTTCGG CAACAAGTCG TACCGTGGCC
GTGTCGTCAA GTAGCGATGA AGGGTCGGGT AATGGCTATA ACAATAGCTA CTCTTCGGCT
GCTGCACCTA CGATTTACAC GAACATGATT CAGACCGATG CAGCTATCAA CCCGGGCAAT
TCCGGTGGTG CGCTTGTTGA TGCAAATGGT AAGCTCATTG GCATCAATGC GGTTATCGAA
TCGTATTCAG GTAACTATTC AGGTGTTGGA TTTGCCATCC CTGTTGACTA CGCCATGGAT
ATCGCGCAGC AGATTATTTC TGGTAAAACG CCAACGCATG CCATGATTGG GGTAACGCCG
ATTTCAATTA CGAGCCAGCT CAATCAGCGC TACCACTTGG GTACTGATTC AGGTGCGTAT
GTATCGAGCA TTGTTGAAGG TAGTGGCGCT GCGCAGGCTG GCCTTGAAGA GGGCGATATC
ATCACGAAGG TTGATGATAC GGCGGTAACA GACGCAACAG GATTGATCGC GGCTGTGCGT
TCGAAGAATG TCGGCGACAC AGTAACGGTT ACCTATCTGC GCGATGGTCA GCAGATGACC
GCTCAAGTGA CCCTTGGATC AGACGACAAT GCCAATTCGC TGTTGCGCAA GAATAGTAGC
AGCCGGGGCG GCTTATTTGG TATGAGCTAT GACAACCAGG GAAGCGGCAA CAACCAAGGC
TCAACGGCAA CTGGTTCAAC AGGATCGAGT CAGTCTTCTC CTCAAAGGGC AGCTGCTTAG
 
Protein sequence
MTENGIPPVG PTGSSYTGYT GQPGAPIPPQ STQPMTSATQ TAQTMQAAQS ATTAQPTQAT 
QATSYATVPP QFTSVPPQPP TVPPAGTTTA IPAAHVPQTG VSGKRVFGIA FIGALAACAV
AAACFLGYQS ITGHSASQVV LGSSSDSTIT ASDDETDRAE KVADKTLPSV TAINVYTDQS
GWGSMLGRST DQSSLVKTSL GSGVIISSDG YILTNNHVVE SADALKVTAN GQEYDAKVVG
TDPTTDLAVI KIDATGLTPI EIGKSSDLKA GQWVMTVGSP FGLEQSVATG IISATSRTVA
VSSSSDEGSG NGYNNSYSSA AAPTIYTNMI QTDAAINPGN SGGALVDANG KLIGINAVIE
SYSGNYSGVG FAIPVDYAMD IAQQIISGKT PTHAMIGVTP ISITSQLNQR YHLGTDSGAY
VSSIVEGSGA AQAGLEEGDI ITKVDDTAVT DATGLIAAVR SKNVGDTVTV TYLRDGQQMT
AQVTLGSDDN ANSLLRKNSS SRGGLFGMSY DNQGSGNNQG STATGSTGSS QSSPQRAAA