Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_0639 |
Symbol | |
ID | 1168501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | - |
Start bp | 664604 |
End bp | 667717 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637342626 |
Product | collagenase family protein |
Protein accession | NP_716272 |
Protein GI | 24372230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA CTCTACTTTT CGCGGCTATT AGCTTAGCCA TTGCAACACC TTCGCTCGCC CATAACCATT CAGTACAAGA CCCAAACAAT TCAGTAAAAG CGAAAAACAC AGCAATCCCT AGTACGCTAG CACCGTTAGC TTCACCGTCA GCGCCACTCA ATATTGATGT CAGCCATCGG CCGTTATTGC CCACTGACAG TTTGATAGCC TCACCTAACG AGCCAACACA CTCTGAGTAC CTCGATGCAC ATTCTTCACA ACAGCAAGCG TTGCTCGAGA ACTCTCCGAG ATCAAAAGTC AGCCGCTTTG CCGCCAATGC CGACTGTAGT AACTTTGTTG GTAAATCGGG CCAAGCCCTG TTAGATGAGC TAACGCAATC CACGCCACAG TGTGTCGGTA AACTATTTAA TCTCAAGGGA AGTGACGCAA GCAGCGTATT TAGTGAAGCG AATGTATCCA CAGTTGCCAA TGCAATTGCA GCCAAAGCGC CTCAATACAC AGGCGTGGAT AACCAAGGTA TTGAGTCACA TATTTACTTT GTGCGCGCCG CGCTTTATGT GCAGTTTTAC CACCCCAGCG ACGTACCCGC CTACAGCAGC GCGGTGAAAA ACAATTTAAA ATCAGCGCTT AATGCTCTCT TTGCCAATAA CGCCATTTGG ACACTCTCCG ATGCCAATGC CAGCGTTCTC AAAGAAGCAC TGATCCTGAT TGATTCCGCC GAACTGGGGG CTGATTTTAA CTTTGTCACC CTCAAAGTGC TCAATGACTA CAACAGCACA TGGCAAGCCA GCTTTGCGAT GAATGCGGCC GCCAATACCG TGTTTACCAC CTTATTCCGC GCCCAGTGGA ATACTGACAT GCAGGCACTG TTTGCCCGAG ATCATGCCAT TTTGGATGCG CTCAACCAGT TCCAACTTAA CCACAGGGAC TTACTCGGCA CCAATGCTGA ATATGTTTTA GTCAATGCTG TCAGAGAGTT ATCCAGACTG TACTACATTG ATGCCATGCG CCCTAAAGTC ACCCAATTGG TCAAAAATAT TTTAAACAGC ACCAGCAAAA ATGATTCCAG TAGAGTGCTG TGGTTTGCAG CTGCCGAGAT GGCAGATTAC TACGATCGCA GCAATTGTAA TGCTTACCAG ATCTGCGGTT TCAAAGCACA ATTGGCCGCC GATAGCTTAC CGTTCAATTG GAAATGCTCC GACAGCCTCA AGATCCGCGC CCAAGATCTT TACCAAGATC AAGCTAAATG GGCATGCGGA GTATTAAGTC AGCAAGAAAG CCATTTCCAC ACGATACTCG AAACGGGAAT GCAGGCGGTC GCACAGGATA ACAATGATGA TTTAGAGCTG GTGATTTTTG GCAGCTCGTC AGAATATCAA TCCCTTGCCA ACAGCATTTT TGGGATCAAT ACCAACAATG GCGGCATGTA TTTAGAAGGC TCTCCCGCTG GACTTAAAAA CCAAGCGCGT TTTATTGCCT ATGAGGCCGA GTGGCGACAA CCTGATTTTC ATGTGTGGAA CTTACAGCAT GAATACGTGC ATTACCTCGA TGGCCGCTAT AACTTGTTTG GCGACTTCAG CCGCAGCGTA TCCGCCAACA CAATTTGGTG GATTGAAGGA TTAGCAGAAT ATATTTCATA TCGAGATGCC AACCCCGCTG CCATCGCCAT GGGTGAAACC GGTGAGTTTA TGCTCTCAAC CATCTTTAAA AACACCTATG ACTCTGGCCA AGACCGCATT TACCGTTGGG GTTACCTCGC CGTACGGTTC ATGTTTGAAA ACCACCGCGA TGATGTTCGC CAGATCCTAA CCTTCCTGCG TAATAACCAA TATGCTGAAT ATCAAGCATT TATGGATTCC ATTGGCACCC GCTATGACAA CGAATGGCGC GGCTGGCTCA CCAGCGGTTT AAGCACTACC AACAATGGTA TTGTCGATAA AGGCCCAAGT GATGAACAGG CTAATGCCAG TGGTCGCGAA GGCAACTGGG CAGGCCCCGC GGGCACCATC AGCAAAGATT ACTCGCCCTG CCAAGTCAGT AACGAAGCAT ACCGCTACAG TGAGTCTGCC AGTCTCAGTC TTGAGGTACC CATGGAGTGT ATTGATGCCA AACAAGGCCG AGCCAGCTTT AGCTTTGCCA ATAGTGACCG CTCGGCCCAA GATATTTGGA TCAAAATTGG CGGTGGTTGG GGAGATGCCG ATATTTATTA CGATTCAAGG GGATGGGCAA GTGCTGAAAA AAATCAGGGC TATGGCATCG GCAACGGTAA TTACCAAGTG ATCAAAGTGA GCCTAAACCC CAATGAACTT TGGCACTATA TCACCCTAGA AGGGGATTTT GGTGGTGTTG ATATGCTGGT CAGTACCAGT GAGTTAGTTG CCGATACCGA TCCCGACTTA GGTGATGGCG ATACTGGCGG TGAAGTGCCC AGCAACTGTG GTGCCGCAAC CATCAATTAC GGCAAGCTAA CCCTCGGAAA AGATGAATGT ATCAGCGGTG GTCGTAATAG CTTTTATTTC TGGGTCGATG CTGACAACAG CCAATTTAGT GTCAGCACAA CCGGGGGAAC AGGCGATGCC AATATTTACT TTAATGCCAA TACTTGGGCA AGTGCGAGCA ATGCCCAAGC CAGCAGTGTA AATCAAGGCA ACAAAGAGTC CTTTAGTTTT ACGGCAAACC GTGGCTGGCG ATATATCACG GTTGATACCG CGAGTGAATT TAGTGGCGTG ACCTTCAACC TTAAAGCCGG TGGTGGTGGC AGTAGCGTTC CTAATCAAAT TGCCAATGCC TGTGCGACTA AATCACCCGT GAGCTATACC CAACTCACGC CGGGTGATGC CGTCTGCAGC GCCAATGGCC GTAATGACTA TTATCTTTGG ATTCCAGAAG GGACAAGCCA ACTAGAAGTA CGCTCGGCCC ACGGAACTGG AGATGTCAGC CTTTACTCGG GTCGCAGCTG GGCCAATGCC CAGCAATATG AAGCCGCCTC AACAAACGCT GGCAGCACTA AGGAACAAAT CAAGGTCAAT AATCCCAGTG CTGGCTGGTA TTACATTACG CTCCAAAGTG AAGGCCAAAG TGCAGGTGTG GCCCTTCAGG TAGATTTACG CTAA
|
Protein sequence | MKQTLLFAAI SLAIATPSLA HNHSVQDPNN SVKAKNTAIP STLAPLASPS APLNIDVSHR PLLPTDSLIA SPNEPTHSEY LDAHSSQQQA LLENSPRSKV SRFAANADCS NFVGKSGQAL LDELTQSTPQ CVGKLFNLKG SDASSVFSEA NVSTVANAIA AKAPQYTGVD NQGIESHIYF VRAALYVQFY HPSDVPAYSS AVKNNLKSAL NALFANNAIW TLSDANASVL KEALILIDSA ELGADFNFVT LKVLNDYNST WQASFAMNAA ANTVFTTLFR AQWNTDMQAL FARDHAILDA LNQFQLNHRD LLGTNAEYVL VNAVRELSRL YYIDAMRPKV TQLVKNILNS TSKNDSSRVL WFAAAEMADY YDRSNCNAYQ ICGFKAQLAA DSLPFNWKCS DSLKIRAQDL YQDQAKWACG VLSQQESHFH TILETGMQAV AQDNNDDLEL VIFGSSSEYQ SLANSIFGIN TNNGGMYLEG SPAGLKNQAR FIAYEAEWRQ PDFHVWNLQH EYVHYLDGRY NLFGDFSRSV SANTIWWIEG LAEYISYRDA NPAAIAMGET GEFMLSTIFK NTYDSGQDRI YRWGYLAVRF MFENHRDDVR QILTFLRNNQ YAEYQAFMDS IGTRYDNEWR GWLTSGLSTT NNGIVDKGPS DEQANASGRE GNWAGPAGTI SKDYSPCQVS NEAYRYSESA SLSLEVPMEC IDAKQGRASF SFANSDRSAQ DIWIKIGGGW GDADIYYDSR GWASAEKNQG YGIGNGNYQV IKVSLNPNEL WHYITLEGDF GGVDMLVSTS ELVADTDPDL GDGDTGGEVP SNCGAATINY GKLTLGKDEC ISGGRNSFYF WVDADNSQFS VSTTGGTGDA NIYFNANTWA SASNAQASSV NQGNKESFSF TANRGWRYIT VDTASEFSGV TFNLKAGGGG SSVPNQIANA CATKSPVSYT QLTPGDAVCS ANGRNDYYLW IPEGTSQLEV RSAHGTGDVS LYSGRSWANA QQYEAASTNA GSTKEQIKVN NPSAGWYYIT LQSEGQSAGV ALQVDLR
|
| |