Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew185_0630 |
Symbol | |
ID | 5369672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS185 |
Kingdom | Bacteria |
Replicon accession | NC_009665 |
Strand | - |
Start bp | 763121 |
End bp | 766333 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640828828 |
Product | collagenase |
Protein accession | YP_001364854 |
Protein GI | 152999173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTTG ATGCAAGCTT AGGTTCCACG CATCAAATAA CGCCCAAGGA AGATCTGCTT TGCAGACTAG ATAACTACGG ATACACAATG AAAAACACAC TGCTCTTCGC AGCTATTAGC TTAGCTTTTG CGACTCCCGT TCTCGCCCAT GACACTCATA CGAGTCACCC ATTAGGCAGC AACACACCCA GTAATCCACA AGCGCAACCG GCACCACGAC CACCCTCGGC GCCCATTAGC CAAATAGAAT CCCATAGACC GCTACTGCGG GAGACGCCAT TGATGGCATC CCCGAATGAG CCAACTATGA CAGAACAGAC AATGACAGAA CCGCCTAAGA TGCATTCACG CAAGAAGCGA GAAGTATCTG AGCAACCGCC ACTTAAGTTA CAGAGTCGCA GCGCCGCACT GCAGAGCAAT GACTGCAGTG ACTTTGTCGG TAAATCGGGC CAAGCCCTAG TGGATCAATT GAGCCAATCG ACACCCGAGT GCGTCGGTAA GCTCTATAGC CTCAAAGGCA GTAGCGCCAC CGCGCTATTT AGCGAGGCCA ATGTGATCAG TGTCGCCAAC GCCATTGCCA CTAAAGCCAA GGACTACACT GGGGTCGATG TTCAGCATCT TGAATCGCAT ATTTATTTTG TCCGCGCCGC CCTCTATGTG CAGTTTTACA GCCCAAATGA TGTACCCGCT TACAGCAGCG CCGCTAAGGC CAGCTTAAAA TCGGCCCTCA ACGCTTTATT TGCGAACGCT GCGATTTGGA CTGTTTCCGA TGACAATGCT GGCGTGTTGA AAGAAGCCTT AATCCTGATT GACTCTGCCG AGCTCGGCGC CGATTTCAAT CATGTCACCA TAAAAGTGCT AACGGACTAC GACGCTAACT GGCAAGCCAG TTTCGCCATG AACGCCGCAG CCAACTCAGT GTTTACCACC TTATTCCGCG CCCAGTGGAA TGATGACATG CAGGCGCTGT TCGCACGTGA CCAAGGTATT TTAGATGCGC TAAATAACTT TCAACTCGAA CATCGTGACT TGTTAGGCAC TAATGCTGAG TATCTGTTAG TCAACTCAGT CAAAGAGTTA TCTCGGCTGT ATTACATTGA TGCCATGCGC CCGCGAGTGA CTCAGCTAGT TAAAAACATC CTCAGCAGCA CCAGCAAAAC CGAACCGAGC AAAGTGCTTT GGTACGCGGC GGCAGAAATG GCCGACTATT ACGATCGCAG CCATTGTAAC GATTACAATA TCTGTGGCTT TAAGGCGCAG CTAGAGGCCG ATACCCTACC GTTCAACTGG AAATGCTCCG ACAGCCTTAA GATCCGCGCC CAAGATCTCT ATCAAGATCA AGCCAAGTGG GCCTGTGATG TGCTGACCAG CCAAGAGAGC TATTTCCACA GCAAACTTGA AACGGGCATG CAACCCGTAG GGCAAGATAA CAACGATGAT TTAGAGCTAG TGATATTTGG CAGCTCGTCT GAATACAAAT CCTTAGCGAA CAGTATCTTT GGCATCAATA CCGACAACGG CGGCATGTAC CTTGAAGGCT CGCCCGCCGG ACTCAAAAAT CAGGCGCGTT TTATCGCCTA TGAAGCCGAA TGGCGCACGC CGGATTTCCA TGTCTGGAAC CTGCAACACG AATATGTGCA CTACCTCGAC GGGCGCTATA ACCTGTTTGG TGACTTTAGC CGCGGCACTT CGGCCAATAC TATTTGGTGG ATTGAAGGCC TAGCGGAATA CATTTCTTAC CGTGACGCCA ATACCGCGGC CATTGCCATG GGCGAAACCG GTGAATTTAT GTTGTCGACC ATATTCAAAA ACAACTATGA ATCAGGCCAA GACCGTATCT ACCGTTGGGG TTATCTGGCG GTGCGCTTTA TGTTTGAGCA TCACAGGGAC GATGTAAGAC AGATTCTCGC CTACCTACGT AACGACCAAT ATGCTGAATA TCAAACCTTT ATGGATGGTA TTGGCACACG TTACGACAAC GAGTGGCAAG GTTGGCTCGC CAGCGGCTTG AGCACGGCTG ACGATGGCAT AGTCGATAAG GGCCCAAGTG ATGTCGATGC TGAGCCCAGT GGCCGTGAAG GCAATTGGAC TGGTCCTGCG GGCACTATCA GTAAGGATTA CTCGCCTTGC CAAGTGACGG ACGAAGCCTA CCGTTACACT GAATCGGCCA GCCTGAAAAT CGATGTGCCG ATGGAATGTA TTGATTCTAA ACTCGGTCGC GCCAGTTTCA GTTTCACTAA TACCGATCGC TCGGCCCAAG ATCTTTGGAT CAAGATTGGC GGAGGTTGGG GTGATGCCGA CATTTATTTC AATTCTAAAG GTTGGGCAAG CCCAGAACAA AACCAAGGCG CGGGGATCGG CAACGGTAAC TATCAAGTCA TTAAAGTGCA GTTAAACCCT GACGAATATT GGCACTACAT TACCTTGTCG GGGGATTTTG GCGGCGTCGA TATGCAAGTC AGCACCACTG AGTTGTTCGC TGATGTTGAT CCCGATCTGG GAGATGGCGG CGTGGATCCT GAGCCACCAG CAAATTGCGG TGCAGTAACC TTAGATTACG GCCAGCTCAC ACTGGGTAAA AACGAGTGTA TCAGTGGTGG ACGCAACAGC TTCTACTTCT GGGTTGAGGA AGATAATACT CAGTTCACCG TCACCACTAA GGGCGGCTCT GGGGATGCCA ATCTTTACTT CAATGCCAGT CAGTGGGCCG ACGCCGATAA TGCCGATGCC AAGAGCACTC AAGTGGGTAA TCAAGAGTCG CTAAGTTTTA GTGCCAATCG CGGCTGGCGT TATATCACAG TGGACACTGC CACTGAGTTT AGCGGCGTCA CTCTCACCCT CAATACAGGC GCGAGCAACA CACCTACGCC AGCTCTAATC GCTAATGCCT GCACGACGCA ATCACCCTTG AGTCATGGTG AGCTGAGTTC AGGCAAAGCC ATCTGCACCA CCGATGGTCG CAGCGATTAC TATCTTTGGG TGCCAGAAGG TACGAGTCAG TTAAGCATCA ACTCGGCCCA CGGCAGTGGT GATGTGAGCC TGTTTTCGGG GACAACTTGG GCCAATGCTC AACACTTCGA TGCAGCGTCT GTCACGCCCG CTAACACTCA AGAAAGTATC ACAGTCGATG CGCCAAATGT GGGGTGGTAT TACATCACAG TGCAGAGTGA ACCTCAGAGT TCGGGTGTCG CACTGCAAGT CGATTTACGC TAG
|
Protein sequence | MRFDASLGST HQITPKEDLL CRLDNYGYTM KNTLLFAAIS LAFATPVLAH DTHTSHPLGS NTPSNPQAQP APRPPSAPIS QIESHRPLLR ETPLMASPNE PTMTEQTMTE PPKMHSRKKR EVSEQPPLKL QSRSAALQSN DCSDFVGKSG QALVDQLSQS TPECVGKLYS LKGSSATALF SEANVISVAN AIATKAKDYT GVDVQHLESH IYFVRAALYV QFYSPNDVPA YSSAAKASLK SALNALFANA AIWTVSDDNA GVLKEALILI DSAELGADFN HVTIKVLTDY DANWQASFAM NAAANSVFTT LFRAQWNDDM QALFARDQGI LDALNNFQLE HRDLLGTNAE YLLVNSVKEL SRLYYIDAMR PRVTQLVKNI LSSTSKTEPS KVLWYAAAEM ADYYDRSHCN DYNICGFKAQ LEADTLPFNW KCSDSLKIRA QDLYQDQAKW ACDVLTSQES YFHSKLETGM QPVGQDNNDD LELVIFGSSS EYKSLANSIF GINTDNGGMY LEGSPAGLKN QARFIAYEAE WRTPDFHVWN LQHEYVHYLD GRYNLFGDFS RGTSANTIWW IEGLAEYISY RDANTAAIAM GETGEFMLST IFKNNYESGQ DRIYRWGYLA VRFMFEHHRD DVRQILAYLR NDQYAEYQTF MDGIGTRYDN EWQGWLASGL STADDGIVDK GPSDVDAEPS GREGNWTGPA GTISKDYSPC QVTDEAYRYT ESASLKIDVP MECIDSKLGR ASFSFTNTDR SAQDLWIKIG GGWGDADIYF NSKGWASPEQ NQGAGIGNGN YQVIKVQLNP DEYWHYITLS GDFGGVDMQV STTELFADVD PDLGDGGVDP EPPANCGAVT LDYGQLTLGK NECISGGRNS FYFWVEEDNT QFTVTTKGGS GDANLYFNAS QWADADNADA KSTQVGNQES LSFSANRGWR YITVDTATEF SGVTLTLNTG ASNTPTPALI ANACTTQSPL SHGELSSGKA ICTTDGRSDY YLWVPEGTSQ LSINSAHGSG DVSLFSGTTW ANAQHFDAAS VTPANTQESI TVDAPNVGWY YITVQSEPQS SGVALQVDLR
|
| |