Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_3732 |
Symbol | |
ID | 4843317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 4368279 |
End bp | 4371404 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640120999 |
Product | collagenase |
Protein accession | YP_001052075 |
Protein GI | 126175926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA CACTGCTCTT CGCAGCTATT AGCTTAGCTT TTGCGACTCC CGTTCTCGCC CATGACACTC ATACGAGTCA CCCATTAGGC AGCAACACAC CCAGTAATCC ACAAGCGCAA CCGGCACCAC GACCACCCTC GGCGCCCATT AGCCAAATAG AATCCCATAG ACCGCTACTG CGGGAGACGC CATTGATGGC ATCCCCGAAT GAGCCAACTA TGACAGAACA GACAATGACA GAACCGCCTA AGATGCATTC ACGCAAGAAG CGAGAAGTAT CTGAGCAACC GCCACTTAAG TTACAGAGTC GCAGCGCCGC ACTGCAGAGC AATGACTGCA GTGACTTTGT CGGTAAATCG GGCCAAGACC TAGTGGATCA ATTGAGCCAA TCGACACCCG AATGCGTCGG TAAGCTCTAT AGCCTCAAAG GCAGTGCTGC GACCGCGCTA TTTAGCGAGG CCAATGTGAT CAGTGTCGCC AACGCCATTG CCACTAAAGC CAAGGACTAC ACTGGGGTCG ATGTTCAGCA TCTTGAATCG CATATTTACT TTGTCCGCGC CGCCCTCTAT GTGCAGTTTT ACAGCCCAAA TGATGTACCC GCTTACAGCA GCGCCGCTAA GGCCAGCTTA AAATCGGCTC TTAACGCTTT ATTTGCTAAC GCTGCGATTT GGACTGTCTC CGATGACAAT GCTGGCGTGT TGAAAGAAGC CTTAATCCTG ATTGACTCTG CCGAGCTCGG CGCCGATTTC AATCATGTCA CCATAAAAGT GCTAACGGAC TACGACGCTA ACTGGCAAGC CAGTTTCGCC ATGAACGCCG CAGCCAACTC AGTGTTTACC ACCTTATTCC GCGCCCAGTG GAATGATGAC ATGCAGGCGC TGTTCGCACG TGACCAAGGC ATTTTAGATG CGCTGAATAA CTTCCAGCTC GAACACCGCG ACTTGTTAGG CACCAATGCA GAGTATCTGT TAGTCAACTC AGTCAAAGAG TTATCAAGAC TGTATTACAT TGATGCCATG CGCCCGCGAG TGACTCAGCT AGTTAAAAAC ATCCTCAGCA GCACCAGCAA AACCGAACCG AGCAAAGTGC TTTGGTACGC GGCGGCAGAA ATGGCCGACT ATTACGATCG CAGCCATTGT AACGATTACA ATATCTGTGG TTTTAAGGCG CAGTTAGAGG CCGATGCCTT ACCTTTCAAC TGGAAATGCT CCGACAGTCT CAAGATCCGC GCCCAAGATC TCTATCAAGA TCAAGCCAAG TGGGCCTGTG ATGTGCTGAC CAGCCAAGAA AGCTATTTCC ACAGTAAACT TGAAACGGGT ATGCAACCTG TAGGGCAAGA TAACAACGAT GACTTAGAGC TAGTGATATT TGGCAGCTCG TCTGAGTATA AATCCTTAGC CAACAGTATC TTTGGTATCA ATACCGACAA CGGCGGCATG TACCTTGAAG GCTCGCCCGC CGGACTCAAA AATCAGGCGC GTTTTATCGC CTATGAAGCC GAATGGCGCA CGCCGGATTT CCATGTCTGG AACCTGCAAC ACGAATATGT GCACTACCTC GACGGGCGCT ATAACCTGTT TGGTGACTTT AGCCGCGGCA CGTCAGCCAA TACCATTTGG TGGATTGAAG GCCTCGCAGA ATACATTTCT TACCGTGACG CCAATACCGC GGCGATTGCC ATGGGCGAAA CCGGTGAGTT TATGTTGTCG ACCATATTCA AAAACAACTA TGAATCGGGC CAAGACCGTA TCTACCGTTG GGGTTATCTG GCGGTGCGCT TTATGTTTGA GCATCACAGG GACGATGTAA GACAGATTCT CGCCTACCTA CGTAACGACC AATACGCTGA ATATCAAACC TTTATGGATG GTATTGGCAC ACGTTACGAC AACGAGTGGC AAGGTTGGCT CGCCAGCGGC TTGAGCACGG CTGACGATGG TATAGTCGAT AAAGGCCCAA GTGATGTAGA TGCTGAGCCC AGTGGCCGTG AAGGTAATTG GACTGGCCCT GCGGGCACTA TCAGTAAGGA TTACTCCCCT TGCCAAGTGA CGAACGAAGC CTACCGTTAC ACTGAATCGG CCAGCCTGAA AATCGATGTG CCGATGGAAT GCATCGACTC CAAACTCGGT CGCGCCAGTT TTAGCTTCAC TAATACCGAG CGCTCGGCCC AAGATCTTTG GATCAAGATT GGCGGCGGTT GGGGTGATGC CGACATTTAT TTCAATTCTA AAAGTTGGGC AAGCCCAGAA CAAAACCAAG GCGCGGGGAT CGGCAACGGT AACTATCAAG TCATTAAAGT GCAGTTAAAC CCTGACGAAT ATTGGCACTA CATTACCTTG TCGGGGGATT TTGGCGGCGT CGATATGCAA GTCAGCACCA CTGAGCTGTT CGCTGATGTT GATCCCGATC TGGGAGATGG CGGCGTGGAT CCTGAGCCAC CAGCAAATTG CGGTGCAGTG ACCTTAGATT ACGGCCAGCT CACACTGGGC AAAAACGAGT GTATCAGTGG TGGACGCAAC AGCTTCTACT TCTGGGTTGA GGAAGATAAT ACTCAGTTCA CCGTCACCAC TAAGGGTGGC TCTGGGGATG CCAATCTCTA CTTCAATGCC AGTCAGTGGG CCGACGCCGA TAATGCCGAT GCCAAGAGCA CTCAAGTGGG TAATCAAGAG TCGCTAAGTT TTAGTGCCAA TCGCGGCTGG CGTTATATCA CAGTGGATAC TGCCACTGAG TTTAGCGGCG TCACACTCAC CCTCAATACA GGCGCGAGCA ATACGCCTAC GCCAGCTCTA ATCGCTAATG CCTGCACGAC GCAATCGCCC TTGAGTCATG GTGAGCTGAG TTCAGGCAAA GCCATCTGCA CCGCCGATGG CCGCAGCGAT TACTATCTTT GGGTGCCAGA AGGTACGAGT CAGTTAAGCA TCAACTCGGC CCATGGCAGT GGTGATGTGA GCCTGTTTTC GGGGACAACT TGGGCCAATG CTCAACACTT CGATGCAGCG TCTGTCACGC CCGCTAACAC TCAAGAAAGT ATCACAGTCG ATGCACCAAG CCTGGGTTGG TATTACATCA CAGTGCAGAG TGAACCCCAG AGTTCAGGAG TCGCACTGCA GGTTGATTTA CGCTAG
|
Protein sequence | MKNTLLFAAI SLAFATPVLA HDTHTSHPLG SNTPSNPQAQ PAPRPPSAPI SQIESHRPLL RETPLMASPN EPTMTEQTMT EPPKMHSRKK REVSEQPPLK LQSRSAALQS NDCSDFVGKS GQDLVDQLSQ STPECVGKLY SLKGSAATAL FSEANVISVA NAIATKAKDY TGVDVQHLES HIYFVRAALY VQFYSPNDVP AYSSAAKASL KSALNALFAN AAIWTVSDDN AGVLKEALIL IDSAELGADF NHVTIKVLTD YDANWQASFA MNAAANSVFT TLFRAQWNDD MQALFARDQG ILDALNNFQL EHRDLLGTNA EYLLVNSVKE LSRLYYIDAM RPRVTQLVKN ILSSTSKTEP SKVLWYAAAE MADYYDRSHC NDYNICGFKA QLEADALPFN WKCSDSLKIR AQDLYQDQAK WACDVLTSQE SYFHSKLETG MQPVGQDNND DLELVIFGSS SEYKSLANSI FGINTDNGGM YLEGSPAGLK NQARFIAYEA EWRTPDFHVW NLQHEYVHYL DGRYNLFGDF SRGTSANTIW WIEGLAEYIS YRDANTAAIA MGETGEFMLS TIFKNNYESG QDRIYRWGYL AVRFMFEHHR DDVRQILAYL RNDQYAEYQT FMDGIGTRYD NEWQGWLASG LSTADDGIVD KGPSDVDAEP SGREGNWTGP AGTISKDYSP CQVTNEAYRY TESASLKIDV PMECIDSKLG RASFSFTNTE RSAQDLWIKI GGGWGDADIY FNSKSWASPE QNQGAGIGNG NYQVIKVQLN PDEYWHYITL SGDFGGVDMQ VSTTELFADV DPDLGDGGVD PEPPANCGAV TLDYGQLTLG KNECISGGRN SFYFWVEEDN TQFTVTTKGG SGDANLYFNA SQWADADNAD AKSTQVGNQE SLSFSANRGW RYITVDTATE FSGVTLTLNT GASNTPTPAL IANACTTQSP LSHGELSSGK AICTADGRSD YYLWVPEGTS QLSINSAHGS GDVSLFSGTT WANAQHFDAA SVTPANTQES ITVDAPSLGW YYITVQSEPQ SSGVALQVDL R
|
| |