Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_0657 |
Symbol | |
ID | 5752374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 797677 |
End bp | 800889 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641286919 |
Product | collagenase |
Protein accession | YP_001553095 |
Protein GI | 160873779 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTTG ATGCAAGCTT TGGTTCCACG CATCAAATAA CGCCCAAGGA AGATCTGCTT TGCAGACTAG ATAACTACGG ATACACAATG AAAAACACAC TGCTCTTCGC AGCTATTAGC TTAGCTTTTG CGACTCCCGC TCTCGCCCAT GACACTCATA CGAGTCACCC ATTAGGCAGC AACACACCCA GCAATCCACA AGCGCAACCG GCACCACGAC CACCCTCGGC GCCCATTAGC CAAATAGAAT CCCATAGACC GCTACTGCGG GAGACGCCAT TGATGGCATC CCCGAATGAG CCAACTATGA CAGAACAGAC AATGACAGAA CCGCCTAAGA TGCATTCACG CAAGAAGCGA GAAGTATCTG AGCAACCGCC ACTTAAGTTA CAGAGTCGCA GCGCCGCACT GCAGAGCAAT GACTGCAGTG ACTTTGTCGG TAAATCGGGC CAAGCCCTAG TGGATCAATT GAGCCAATCG ACACCCGAAT GCGTCGGTAA GCTCTATAGC CTCAAAGGCA GTGCTGCGAC CGCGCTATTT AGCGAGGCCA ATGTGATCAG TGTCGCCAAC GCCATTGCCA CTAAAGCCAA GGACTACACT GGGGTCGATG TTCAGCATCT TGAATCGCAT ATTTACTTTG TCCGCGCAGC CCTCTATGTG CAGTTTTACA GCCCAAATGA TGTACCCGCT TACAGCAGCG CCGCTAAGGC CAGCTTAAAA TCGGCCCTCA ACGCTTTATT TGCCAACGCT GCGATTTGGA CTGTCTCCGA TGACAATGCT GGCGTGTTGA AAGAAGCCTT AATCCTGATT GACTCTGCCG AGCTCGGCGC CGATTTCAAT CATGCCACCA TTAAAGTGTT AATGGACTAC GACGCTAACT GGCAAGCCAG TTTCGCCATG AACGCCGCTG CAAACTCAGT GTTTACTACC TTATTCCGCG CCCAGTGGAA TGATGACATG CAGGCGCTGT TCGCTCGTGA CCAAGGCATT TTAGATGCGC TGAATAACTT CCAACTCGAA CACCGCGACT TGTTAGGTAC CAATGCTGAG TATCTGTTAG TCAACTCAGT CAAAGAGTTA TCAAGACTGT ATTACATTGA TTCCATGCGC CCGCGAGTGA CTCAGCTCGT TAAAAACATC CTCAGCAGCA CCAGCAAAAC CGAACCGAGC AAAGTGCTTT GGTACGCGGC GGCAGAAATG GCCGACTATT ACGATCGCAG CCATTGTAAC GATTACAATA TCTGTGGTTT TAAGGCGCAG TTAGAGGCCG ATACCCTACC TTTCAACTGG AAATGCTCCG ACAGTCTCAA GATCCGCGCA CAAGATCTCT ATCAAGATCA AGCCAAGTGG GCCTGTGATG TGTTGACCAG CCAAGAGAGC TATTTCCACA GCAAACTTGA AACGGGTATG CAACCTGTAG GGCAAGATAA CAACGATGAT TTAGAGCTAG TGATATTTGG CAGTTCGTCT GAGTATAAAT CCTTAGCGAA CAGTATCTTT GGTATCAATA CCGACAACGG CGGCATGTAC CTTGAGGGCT CGCCCGCCGG ACTCAAAAAT CAGGCGCGTT TTATCGCCTA TGAAGCCGAA TGGCGCACGC CGGATTTCCA TGTCTGGAAC CTGCAACACG AATATGTGCA CTACCTCGAC GGGCGCTATA ACCTGTTTGG TGACTTTAGC CGCGGCACAT CAGCCAATAC CATTTGGTGG ATTGAAGGCC TCGCAGAATA CATTTCTTAC CGTGACGCCA ATACCGCGGC CATTGCCATG GGCGAAACCG GTGAGTTTAT GTTGTCGACC ATATTCAAAA ACAACTATGA ATCGGGCCAA GACCGTATCT ACCGTTGGGG TTATCTGGCG GTGCGCTTTA TGTTTGAGCA TCACAGGGAC GATGTAAGAC AGATTCTCGC CTACCTACGT AACGACCAAT ACGCGGAATA TCAAACCTTT ATGGATGGTA TTGGCACACG TTACGACAAC GAGTGGCAAG GTTGGCTCGC CAGCGGCTTG AGCACGGCTG ACGATGGCAT AGTCGATAAA GGCCCAAGTG ATGTCGATGC TGAGCCCAGT GGCCGTGAAG GTAATTGGAC TGGCCCTGCG GGCACTATCA GTAAGGATTA CTCGCCTTGC CAAGTGACGA ACGAAGCCTA CCGTTACACT GAATCGGCCA GCCTGAAAAT CGATGTGCCG ATGGAATGTA TTGATTCCAA ACTCGGCCGC GCCAGTTTCA GTTTTACTAA TACCGAGCGC TCGGCCCAAG ATCTGTGGAT CAAGATTGGC GGCGGTTGGG GTGATGCCGA CATTTATTTC AATTCTAAAA GTTGGGCAAG CCCAGAGCAA AACCAAGGCG CGGGGATCGG CAATGGTAAC TATCAAGTCA TTAAAGTGCA GTTAAACCCT GACGAATATT GGCACTACAT TACCTTGTCG GGGGACTTTG GCGGCGTTGA TATGCAAGTC AGCACCACTG AGCTGTTCGC TGATGTTGAT CCCGATCTGG GAGATGGCGG CGTGGATCCT GAGCCACCAG CAAATTGCGG TGCAGTAACC TTAGATTACG GCCAGCTCAC ACTGGGCAAA AACGAGTGTA TCAGTGGTGG ACGCAACAGC TTCTACTTCT GGGTTGAGGA AGATAATACC CAGTTCACCG TCACCACTAA GGGCGGCTCT GGGGATGCCA ATCTCTACTT CAATGCCAGT CAGTGGGCCG ACGCCGATAA TGCCGATGCC AAGAGCACTC AAGCAGGTAA TCAAGAGTCG CTAAGTTTTA GTGCCAATCG CGGCTGGCGT TATATCACAG TGGATACTGC CACTGAGTTT AGCGGCGTCA CACTCACCCT CAATACAGGC GCGAGCAATA CGCCTACGCC AGCTCTAATC GCTAATGCCT GCACGACGCA ATCGCCCTTG AGTCATGGTG AGCTGAGTTC AGGCAAAGCC ATCTGCACCG CCGATGGCCG CAGCGATTAC TATCTTTGGG TGCCAGAAGG TACGAGTCAG TTAAGCATCA ACTCGGCCCA TGGCAGTGGT GATGTGAGCC TGTTTTCGGG GACAACTTGG GCCAATGCTC AACACTTCGA TGCAGCGTCT GTCACGCCCG CTAACACTCA AGAAAGTATC ACAGTCGATG CGCCAAATGT GGGTTGGTAT TACATCACAG TGCAGAGTGA ACCTCAGAGT TCGGGTGTCG CACTGCAGGT TGATTTACGC TAG
|
Protein sequence | MRFDASFGST HQITPKEDLL CRLDNYGYTM KNTLLFAAIS LAFATPALAH DTHTSHPLGS NTPSNPQAQP APRPPSAPIS QIESHRPLLR ETPLMASPNE PTMTEQTMTE PPKMHSRKKR EVSEQPPLKL QSRSAALQSN DCSDFVGKSG QALVDQLSQS TPECVGKLYS LKGSAATALF SEANVISVAN AIATKAKDYT GVDVQHLESH IYFVRAALYV QFYSPNDVPA YSSAAKASLK SALNALFANA AIWTVSDDNA GVLKEALILI DSAELGADFN HATIKVLMDY DANWQASFAM NAAANSVFTT LFRAQWNDDM QALFARDQGI LDALNNFQLE HRDLLGTNAE YLLVNSVKEL SRLYYIDSMR PRVTQLVKNI LSSTSKTEPS KVLWYAAAEM ADYYDRSHCN DYNICGFKAQ LEADTLPFNW KCSDSLKIRA QDLYQDQAKW ACDVLTSQES YFHSKLETGM QPVGQDNNDD LELVIFGSSS EYKSLANSIF GINTDNGGMY LEGSPAGLKN QARFIAYEAE WRTPDFHVWN LQHEYVHYLD GRYNLFGDFS RGTSANTIWW IEGLAEYISY RDANTAAIAM GETGEFMLST IFKNNYESGQ DRIYRWGYLA VRFMFEHHRD DVRQILAYLR NDQYAEYQTF MDGIGTRYDN EWQGWLASGL STADDGIVDK GPSDVDAEPS GREGNWTGPA GTISKDYSPC QVTNEAYRYT ESASLKIDVP MECIDSKLGR ASFSFTNTER SAQDLWIKIG GGWGDADIYF NSKSWASPEQ NQGAGIGNGN YQVIKVQLNP DEYWHYITLS GDFGGVDMQV STTELFADVD PDLGDGGVDP EPPANCGAVT LDYGQLTLGK NECISGGRNS FYFWVEEDNT QFTVTTKGGS GDANLYFNAS QWADADNADA KSTQAGNQES LSFSANRGWR YITVDTATEF SGVTLTLNTG ASNTPTPALI ANACTTQSPL SHGELSSGKA ICTADGRSDY YLWVPEGTSQ LSINSAHGSG DVSLFSGTTW ANAQHFDAAS VTPANTQESI TVDAPNVGWY YITVQSEPQS SGVALQVDLR
|
| |