Gene Shel_23520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_23520 
Symbol 
ID8396241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp2602450 
End bp2603778 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content62% 
IMG OID644987099 
Productcollagenase-like protease 
Protein accessionYP_003144710 
Protein GI257065038 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000439045 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAAGA TGGAGCTCCT GGCGCCCGCG GGCGGTTGGG AACAGTTGGA ATACGCGGTT 
CATTTCGGGG CCGACGCCGT GTATCTGGCG TCGCAGCGCT ACGGCATGCG CCGTCGGGCG
GACAACTTTA AAGAAGAGGA CCTGCCTCGG GCCATCGCTT TCGCACACGA CCACGGCGTT
GCGGTCCATG TGACCGTGAA CACGCTCATG ACCGACGAGA ACATCGACGA TCTGCCTCGA
TACTTCAAGC TGCTGGGAGA CGCCGGCGCC GATGCGGCCA TCATCGCGGA TATGGGAGCT
TTGGCCATCT GCCGCGAGGT GGCGCCGCAT GTTGACATCC ATCTGTCCAC GCAGGCGTCC
TGCATGAACG CCGCCTCGGC CCAGGTGTAC CAGAGCTTGG GCGTCAAGCG CGTCGTGCTG
GCTCGCGAAA TGAACCTGGA CGAAATCGCA CGCATGAAGA GCCGTCTGCC CGAGGGGCTT
GAAATCGAGG CCTTCGCCCA CGGTGCCATG TGCATGGCCT ATTCGGGTCG CTGCCTGATC
AGCGATTACC TTACCGGCCG CGGCGCCAAC AAAGGCAGTT GCGCACAACC CTGCCGTTGG
GAATACGCCC TGACCGAGCC GACGCGCCCG GGCGAGTACT TCCCGGTGGA AGAGGATGCC
GAGCAGGGGA GCTTCATTAT GAGCTCCCGC GATATGAGCA TGCTGGGGCA TTTGGACGAC
CTGGCGGCGG CGGGCATCGA CAGCATCAAG ATCGAGGGCC GCGCGAAAGG CACCTACTAC
GTGGCTTCGG TGGTGAACGC CTACCGCAAT GTGCTGGACG GCGGCGACCC CGAGGTGTGG
CAGCGTGAGC TGGAAACCAC CAGCCATCGC CCCTATTCCA CGGGGTTCTA TTACGGATTC
CCAGGTCAGA ATCCGATTTC TGCACAATAC AGCCGCAAAT ACCAGATGGT TGCCACGGTT
AAGTCCTGCG TGCCGGCCGA CGGAGGGTTC CAAGTGCGCG TCGTGTGCCG CAACCGGTTC
GACGATGGCG ACACGGTGGA GGTCCTGAGT CCTCGGACGC CGGTTCGGGA ATGCACGGTG
CGCAACCTGA TATGGCATGC CGCGCCGGAA ACCGACCTGA CGGACATCCT GCGGGACAAC
CTGGGTACCG TCGTCGGCCC CGACCCGGAA GTGGTGCACG GGCGTCTGTT GCGCGTGGGT
ATAGCCAACC GGACCATGGA GGAATACTCT TTCGATGTAC CGTTTGGATT GCAAGAACGT
GACATTGTGC GCATCTCGCG TGATACGTCG GCGATTATTT GCGAAAATGG ACCATCTCCG
TTTGCGTAA
 
Protein sequence
MAKMELLAPA GGWEQLEYAV HFGADAVYLA SQRYGMRRRA DNFKEEDLPR AIAFAHDHGV 
AVHVTVNTLM TDENIDDLPR YFKLLGDAGA DAAIIADMGA LAICREVAPH VDIHLSTQAS
CMNAASAQVY QSLGVKRVVL AREMNLDEIA RMKSRLPEGL EIEAFAHGAM CMAYSGRCLI
SDYLTGRGAN KGSCAQPCRW EYALTEPTRP GEYFPVEEDA EQGSFIMSSR DMSMLGHLDD
LAAAGIDSIK IEGRAKGTYY VASVVNAYRN VLDGGDPEVW QRELETTSHR PYSTGFYYGF
PGQNPISAQY SRKYQMVATV KSCVPADGGF QVRVVCRNRF DDGDTVEVLS PRTPVRECTV
RNLIWHAAPE TDLTDILRDN LGTVVGPDPE VVHGRLLRVG IANRTMEEYS FDVPFGLQER
DIVRISRDTS AIICENGPSP FA