Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_0653 |
Symbol | |
ID | 7089784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 787969 |
End bp | 791094 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643459566 |
Product | Microbial collagenase |
Protein accession | YP_002356596 |
Protein GI | 217971845 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA CACTGCTCTT CGCAGCTATT AGCTTAGCTT TTGCGACTCC CGCTCTCGCC CATGACACTC ATACGAGTCA CCCATTAGGC AGCAACACAC CCAGCAATCC ACAAGCGCAA CCGGCACCAC GACCACCCTC GGCGCCCATT AGCCAAATAG AATCCCATAG ACCGCTACTG CGGGAGACGC CATTGATGGC ATCCCCGAAT GAGCCAACTA TGACAGAACA GACAATGACA GAACCGCCTA AGATGCATTC ACGCAAGAAG CGAGAAGTAT CTGAGCAACC GCCACTTAAG TTACAGAGTC GCAGCGCCGC ACTGCAGAGC AATGACTGCA GTGACTTTGT CGGTAAATCG GGCCAAGCCC TAGTGGATCA ATTGAGCCAA TCGACACCCG AATGCGTCGG TAAGCTCTAT AGCCTCAAAG GCAGTGCTGC GACCGCGCTG TTTAGCGAGG CCAATGTGAT CAGTGTAGCT AACGCCATTG CCACTAAAGC CAAGGACTAC ACTGGGGTCG ATGTTCAGCA TCTTGAATCG CATATTTACT TTGTCCGCGC CGCCCTCTAT GTGCAGTTTT ACAGCCCAAA TGATGTACCA GCTTACAGCA GCGCCGCTAA GGCCAGCTTA AAATCGGCTC TCAACGCTTT ATTTGCTAAC GCTGCGATTT GGACTGTCTC CGATGACAAT GCTGGCGTGT TGAAAGAAGC CTTAATCCTA ATTGACTCTG CCGAGCTCGG CGCCGATTTC AATCATGTCA CCATAAAAGT GCTAACGGAC TACGACGCTA ACTGGCAAGC CAGTTTCGCC ATGAACGCCG CAGCCAACTC AGTGTTTACC ACCTTATTCC GCGCCCAGTG GAATGATGAC ATGCAGGCGC TGTTCGCACG TGACCAAGGC ATTTTAGATG CGCTGAATAA CTTCCAGCTC GAACACCGCG ACTTGTTAGG CACCAATGCA GAGTATCTGT TAGTCAACTC AGTCAAAGAG TTATCAAGAC TGTATTACAT TGATGCCATG CGCCCGCGAG TGACTCAGCT AGTTAAAAAC ATCCTCAGCA GCACCAGCAA AAGCGAACCG AGTAAGGTGC TTTGGTACGC GGCGGCAGAA ATGGCCGACT ATTACGATCG CAGCCATTGT AACGATTACA ATATCTGTGG TTTTAAGGCG CAGCTAGAGG CCGATGCCCT ACCGTTCAAC TGGAAATGCT CCGACAGTCT CAAGATCCGC GCCCAAGATC TCTATCAAGA TCAAGCCAAG TGGGCCTGTG ATGTGCTGAC CAGCCAAGAG AGCTATTTCC ACAGCAAACT CGAAACGGGC ATGCAACCTG TAGGGCAAGA TAACAACGAT GACTTAGAGC TAGTGATCTT TGGCAGCTCG TCTGAGTATA AATCCTTAGC CAACAGTATC TTTGGCATCA ATACCGACAA CGGCGGCATG TACCTTGAAG GCTCGCCCGC CGGACTCAAA AATCAGGCGC GTTTTATCGC CTATGAAGCC GAATGGCGCA CGCCGGATTT CCATGTTTGG AACCTACAAC ACGAATATGT GCACTACCTC GACGGGCGCT ATAACCTGTT TGGTGACTTT AGTCGCGGCA CGTCGGCCAA TACCATTTGG TGGATTGAAG GTCTAGCGGA ATACATTTCT TACCGTGACG CCAATACCGC GGCCATTGCC ATGGGCGAAA CCGGTGAATT TATGTTGTCG ACCATATTCA AAAACAACTA TGAATCGGGC CAAGACCGTA TCTACCGTTG GGGTTATCTG GCGGTGCGCT TTATGTTTGA GCATCACAGG GACGATGTAA GACAGATTCT CGCCTACCTA CGTAACGACC AATACGCGGA ATATCAAACC TTTATGGATG GTATCGGTAC ACGTTACGAC AACGAGTGGC AAGGTTGGCT CGCCAGCGGC TTGAGCACGG CTGACGATGG CATTGTCGAT AAAGGCCCAA GTGATGTCGA TGCTGAGCCC AGTGGCCGTG AAGGTAACTG GGCTGGTCCT GCGGGCACTA TCAGTAAGGA TTACTCGCCT TGCCAAGTGA CGAACGAAGC CTACCGTTAC ACAGAATCGG CCAGCCTTAA AATCAATGTG CCGATGGAAT GTATTGATTC TAAACTCGGC CGCGCCAGTT TCAGTTTCAC TAATACCGAT CGCTCGGCCC AAGATCTTTG GATCAAGATT GGCGGCGGTT GGGGTGATGC CGACATTTAT TTCAATTCTA AAGGTTGGGC AAGCCCAGAA CAAAACCAAG GCGCGGGGAT CGGCAACGGT AACTATCAAG TCATTAAAGT GCAGTTAAAC CCTGACGAAT ATTGGCACTA CATTACCTTG TCGGGGGATT TTGGCGGCGT CGATATGCAA GTCAGCACCA CTGAGTTGTT CGCTGATGTT GATCCCGATC TGGGAGATGG CGGCGTGGAT CCTGAGCCAC CAGCAAATTG CGGTGCAGTG ACCTTAGATT ACGGCCAGCT CACACTGGGC AAAAACGAGT GTATCAGTGG TGGACGTAAC AGCTTCTACT TCTGGGTTGA GGAAGATAAT ACCCAGTTCA CCGTCACCAC TAAGGGCGGC TCTGGGGATG CCAATCTCTA CTTCAATGCC AGCCAGTGGG CCGACGCCGA TAATGCCGAT GCCAAGAGCA CTCAAGTAGG TAATCAAGAG TCGCTAAGTT TTAGTGCCAA TCGCGGCTGG CGTTATATCA CAGTGGACAC TGCCACTGAG TTTAGCGGCG TCACTCTCAC CCTCAACACA GGCGCGAGCA ACACACCTAC GCCAGCTCTA ATCGCCAATG CCTGCGCGAC GCAATCGCCC TTGAGTCATG GTGAACTGAG TTCAGGCAAA GCCATCTGCA CCGCCGATGG CCGCAGCGAT TACTATCTTT GGGTGCCAGA AGGTACGAGT CAGTTAAGTA TCAACTCGGC CCACGGCAGT GGTGATGTGA GCCTGTTTTC GGGGACGACT TGGGCCAATA CTCAACACTT CGATGCAGCG TCTGTCACGC CTGCCAACAC TCAAGAAAGT ATCACAGTCG ATGCGCCAAA CGTGGGGTGG TATTACATCA CAGTGCAGAG TGAACCACAG AGTTCAGGTG TCGCACTGCA AGTCGATTTA CGCTAG
|
Protein sequence | MKNTLLFAAI SLAFATPALA HDTHTSHPLG SNTPSNPQAQ PAPRPPSAPI SQIESHRPLL RETPLMASPN EPTMTEQTMT EPPKMHSRKK REVSEQPPLK LQSRSAALQS NDCSDFVGKS GQALVDQLSQ STPECVGKLY SLKGSAATAL FSEANVISVA NAIATKAKDY TGVDVQHLES HIYFVRAALY VQFYSPNDVP AYSSAAKASL KSALNALFAN AAIWTVSDDN AGVLKEALIL IDSAELGADF NHVTIKVLTD YDANWQASFA MNAAANSVFT TLFRAQWNDD MQALFARDQG ILDALNNFQL EHRDLLGTNA EYLLVNSVKE LSRLYYIDAM RPRVTQLVKN ILSSTSKSEP SKVLWYAAAE MADYYDRSHC NDYNICGFKA QLEADALPFN WKCSDSLKIR AQDLYQDQAK WACDVLTSQE SYFHSKLETG MQPVGQDNND DLELVIFGSS SEYKSLANSI FGINTDNGGM YLEGSPAGLK NQARFIAYEA EWRTPDFHVW NLQHEYVHYL DGRYNLFGDF SRGTSANTIW WIEGLAEYIS YRDANTAAIA MGETGEFMLS TIFKNNYESG QDRIYRWGYL AVRFMFEHHR DDVRQILAYL RNDQYAEYQT FMDGIGTRYD NEWQGWLASG LSTADDGIVD KGPSDVDAEP SGREGNWAGP AGTISKDYSP CQVTNEAYRY TESASLKINV PMECIDSKLG RASFSFTNTD RSAQDLWIKI GGGWGDADIY FNSKGWASPE QNQGAGIGNG NYQVIKVQLN PDEYWHYITL SGDFGGVDMQ VSTTELFADV DPDLGDGGVD PEPPANCGAV TLDYGQLTLG KNECISGGRN SFYFWVEEDN TQFTVTTKGG SGDANLYFNA SQWADADNAD AKSTQVGNQE SLSFSANRGW RYITVDTATE FSGVTLTLNT GASNTPTPAL IANACATQSP LSHGELSSGK AICTADGRSD YYLWVPEGTS QLSINSAHGS GDVSLFSGTT WANTQHFDAA SVTPANTQES ITVDAPNVGW YYITVQSEPQ SSGVALQVDL R
|
| |