Gene Sbal195_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_0657 
Symbol 
ID5752374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp797677 
End bp800889 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content51% 
IMG OID641286919 
Productcollagenase 
Protein accessionYP_001553095 
Protein GI160873779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTTG ATGCAAGCTT TGGTTCCACG CATCAAATAA CGCCCAAGGA AGATCTGCTT 
TGCAGACTAG ATAACTACGG ATACACAATG AAAAACACAC TGCTCTTCGC AGCTATTAGC
TTAGCTTTTG CGACTCCCGC TCTCGCCCAT GACACTCATA CGAGTCACCC ATTAGGCAGC
AACACACCCA GCAATCCACA AGCGCAACCG GCACCACGAC CACCCTCGGC GCCCATTAGC
CAAATAGAAT CCCATAGACC GCTACTGCGG GAGACGCCAT TGATGGCATC CCCGAATGAG
CCAACTATGA CAGAACAGAC AATGACAGAA CCGCCTAAGA TGCATTCACG CAAGAAGCGA
GAAGTATCTG AGCAACCGCC ACTTAAGTTA CAGAGTCGCA GCGCCGCACT GCAGAGCAAT
GACTGCAGTG ACTTTGTCGG TAAATCGGGC CAAGCCCTAG TGGATCAATT GAGCCAATCG
ACACCCGAAT GCGTCGGTAA GCTCTATAGC CTCAAAGGCA GTGCTGCGAC CGCGCTATTT
AGCGAGGCCA ATGTGATCAG TGTCGCCAAC GCCATTGCCA CTAAAGCCAA GGACTACACT
GGGGTCGATG TTCAGCATCT TGAATCGCAT ATTTACTTTG TCCGCGCAGC CCTCTATGTG
CAGTTTTACA GCCCAAATGA TGTACCCGCT TACAGCAGCG CCGCTAAGGC CAGCTTAAAA
TCGGCCCTCA ACGCTTTATT TGCCAACGCT GCGATTTGGA CTGTCTCCGA TGACAATGCT
GGCGTGTTGA AAGAAGCCTT AATCCTGATT GACTCTGCCG AGCTCGGCGC CGATTTCAAT
CATGCCACCA TTAAAGTGTT AATGGACTAC GACGCTAACT GGCAAGCCAG TTTCGCCATG
AACGCCGCTG CAAACTCAGT GTTTACTACC TTATTCCGCG CCCAGTGGAA TGATGACATG
CAGGCGCTGT TCGCTCGTGA CCAAGGCATT TTAGATGCGC TGAATAACTT CCAACTCGAA
CACCGCGACT TGTTAGGTAC CAATGCTGAG TATCTGTTAG TCAACTCAGT CAAAGAGTTA
TCAAGACTGT ATTACATTGA TTCCATGCGC CCGCGAGTGA CTCAGCTCGT TAAAAACATC
CTCAGCAGCA CCAGCAAAAC CGAACCGAGC AAAGTGCTTT GGTACGCGGC GGCAGAAATG
GCCGACTATT ACGATCGCAG CCATTGTAAC GATTACAATA TCTGTGGTTT TAAGGCGCAG
TTAGAGGCCG ATACCCTACC TTTCAACTGG AAATGCTCCG ACAGTCTCAA GATCCGCGCA
CAAGATCTCT ATCAAGATCA AGCCAAGTGG GCCTGTGATG TGTTGACCAG CCAAGAGAGC
TATTTCCACA GCAAACTTGA AACGGGTATG CAACCTGTAG GGCAAGATAA CAACGATGAT
TTAGAGCTAG TGATATTTGG CAGTTCGTCT GAGTATAAAT CCTTAGCGAA CAGTATCTTT
GGTATCAATA CCGACAACGG CGGCATGTAC CTTGAGGGCT CGCCCGCCGG ACTCAAAAAT
CAGGCGCGTT TTATCGCCTA TGAAGCCGAA TGGCGCACGC CGGATTTCCA TGTCTGGAAC
CTGCAACACG AATATGTGCA CTACCTCGAC GGGCGCTATA ACCTGTTTGG TGACTTTAGC
CGCGGCACAT CAGCCAATAC CATTTGGTGG ATTGAAGGCC TCGCAGAATA CATTTCTTAC
CGTGACGCCA ATACCGCGGC CATTGCCATG GGCGAAACCG GTGAGTTTAT GTTGTCGACC
ATATTCAAAA ACAACTATGA ATCGGGCCAA GACCGTATCT ACCGTTGGGG TTATCTGGCG
GTGCGCTTTA TGTTTGAGCA TCACAGGGAC GATGTAAGAC AGATTCTCGC CTACCTACGT
AACGACCAAT ACGCGGAATA TCAAACCTTT ATGGATGGTA TTGGCACACG TTACGACAAC
GAGTGGCAAG GTTGGCTCGC CAGCGGCTTG AGCACGGCTG ACGATGGCAT AGTCGATAAA
GGCCCAAGTG ATGTCGATGC TGAGCCCAGT GGCCGTGAAG GTAATTGGAC TGGCCCTGCG
GGCACTATCA GTAAGGATTA CTCGCCTTGC CAAGTGACGA ACGAAGCCTA CCGTTACACT
GAATCGGCCA GCCTGAAAAT CGATGTGCCG ATGGAATGTA TTGATTCCAA ACTCGGCCGC
GCCAGTTTCA GTTTTACTAA TACCGAGCGC TCGGCCCAAG ATCTGTGGAT CAAGATTGGC
GGCGGTTGGG GTGATGCCGA CATTTATTTC AATTCTAAAA GTTGGGCAAG CCCAGAGCAA
AACCAAGGCG CGGGGATCGG CAATGGTAAC TATCAAGTCA TTAAAGTGCA GTTAAACCCT
GACGAATATT GGCACTACAT TACCTTGTCG GGGGACTTTG GCGGCGTTGA TATGCAAGTC
AGCACCACTG AGCTGTTCGC TGATGTTGAT CCCGATCTGG GAGATGGCGG CGTGGATCCT
GAGCCACCAG CAAATTGCGG TGCAGTAACC TTAGATTACG GCCAGCTCAC ACTGGGCAAA
AACGAGTGTA TCAGTGGTGG ACGCAACAGC TTCTACTTCT GGGTTGAGGA AGATAATACC
CAGTTCACCG TCACCACTAA GGGCGGCTCT GGGGATGCCA ATCTCTACTT CAATGCCAGT
CAGTGGGCCG ACGCCGATAA TGCCGATGCC AAGAGCACTC AAGCAGGTAA TCAAGAGTCG
CTAAGTTTTA GTGCCAATCG CGGCTGGCGT TATATCACAG TGGATACTGC CACTGAGTTT
AGCGGCGTCA CACTCACCCT CAATACAGGC GCGAGCAATA CGCCTACGCC AGCTCTAATC
GCTAATGCCT GCACGACGCA ATCGCCCTTG AGTCATGGTG AGCTGAGTTC AGGCAAAGCC
ATCTGCACCG CCGATGGCCG CAGCGATTAC TATCTTTGGG TGCCAGAAGG TACGAGTCAG
TTAAGCATCA ACTCGGCCCA TGGCAGTGGT GATGTGAGCC TGTTTTCGGG GACAACTTGG
GCCAATGCTC AACACTTCGA TGCAGCGTCT GTCACGCCCG CTAACACTCA AGAAAGTATC
ACAGTCGATG CGCCAAATGT GGGTTGGTAT TACATCACAG TGCAGAGTGA ACCTCAGAGT
TCGGGTGTCG CACTGCAGGT TGATTTACGC TAG
 
Protein sequence
MRFDASFGST HQITPKEDLL CRLDNYGYTM KNTLLFAAIS LAFATPALAH DTHTSHPLGS 
NTPSNPQAQP APRPPSAPIS QIESHRPLLR ETPLMASPNE PTMTEQTMTE PPKMHSRKKR
EVSEQPPLKL QSRSAALQSN DCSDFVGKSG QALVDQLSQS TPECVGKLYS LKGSAATALF
SEANVISVAN AIATKAKDYT GVDVQHLESH IYFVRAALYV QFYSPNDVPA YSSAAKASLK
SALNALFANA AIWTVSDDNA GVLKEALILI DSAELGADFN HATIKVLMDY DANWQASFAM
NAAANSVFTT LFRAQWNDDM QALFARDQGI LDALNNFQLE HRDLLGTNAE YLLVNSVKEL
SRLYYIDSMR PRVTQLVKNI LSSTSKTEPS KVLWYAAAEM ADYYDRSHCN DYNICGFKAQ
LEADTLPFNW KCSDSLKIRA QDLYQDQAKW ACDVLTSQES YFHSKLETGM QPVGQDNNDD
LELVIFGSSS EYKSLANSIF GINTDNGGMY LEGSPAGLKN QARFIAYEAE WRTPDFHVWN
LQHEYVHYLD GRYNLFGDFS RGTSANTIWW IEGLAEYISY RDANTAAIAM GETGEFMLST
IFKNNYESGQ DRIYRWGYLA VRFMFEHHRD DVRQILAYLR NDQYAEYQTF MDGIGTRYDN
EWQGWLASGL STADDGIVDK GPSDVDAEPS GREGNWTGPA GTISKDYSPC QVTNEAYRYT
ESASLKIDVP MECIDSKLGR ASFSFTNTER SAQDLWIKIG GGWGDADIYF NSKSWASPEQ
NQGAGIGNGN YQVIKVQLNP DEYWHYITLS GDFGGVDMQV STTELFADVD PDLGDGGVDP
EPPANCGAVT LDYGQLTLGK NECISGGRNS FYFWVEEDNT QFTVTTKGGS GDANLYFNAS
QWADADNADA KSTQAGNQES LSFSANRGWR YITVDTATEF SGVTLTLNTG ASNTPTPALI
ANACTTQSPL SHGELSSGKA ICTADGRSDY YLWVPEGTSQ LSINSAHGSG DVSLFSGTTW
ANAQHFDAAS VTPANTQESI TVDAPNVGWY YITVQSEPQS SGVALQVDLR