Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfri_2078 |
Symbol | |
ID | 4278990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella frigidimarina NCIMB 400 |
Kingdom | Bacteria |
Replicon accession | NC_008345 |
Strand | + |
Start bp | 2466343 |
End bp | 2468583 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638134867 |
Product | peptidase U32 |
Protein accession | YP_750762 |
Protein GI | 114563249 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.292848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAA AAATCGAATT ATTGGCACCC GGTGGAGATG TCGATGCGAT CAAAGCTGCG ATAATAGCCG GTGCAAACGC CGTTTATTGT GGCTTAGATA CCTTCAACGC CCGTAACCGT GCAGCAAACC TATCATTCGA TGACTTAATT GGTGTTATCG CGCTTGCACA TCAATATCAG TGTGAAGTGT TTCTTACTAT GAACATCGTT ATTTTAGAAA ACGAAGTTCC GACATTGGTT AAATTGCTCA ATCAGCTTGT GAATACGGGT ATCGACGGTA TTATCGTCCA GGATTTAGGC CTGTTTAATC TAGTGACTAA ACATTTTCCA AGCCTAGCAA TTCATGCCTC AACCCAGCTG ACAACTCATA ACGAAGGTCA AATTGGCTTC TTAGCCAAAA TCGGCGCCAC TCGAGTCAAT TTATCTCGTG AGCTAAACCT TGGCGAAATT AAAAGCCTCA CCACCCTCGC CCACGATCAC GAAGTGCTTA CCGAAGTTTT TGTTCACGGT GCATTATGTA TCGCCTTCTC TGGTCAATGT TATTCAAGCT CAGTGAGTGT TGGTAATTCT GGCAACCGTG GCCGTTGTAG CCAGGCATGT CGTGACGAAT ATGAAGTCAC AGCTGCGGGC AACAAATTTC CGCTGAATCT AAAAGACAAC TCAGCTTACT TTGATTTACC AGAATTGGTT GATGCCGGTG TTGACTCACT CAAAATTGAA GGCCGCATTA AAGGTGCACA ATATGTGCAT ACCGTGGTGG ATAGCTGGCG TAAACAGATC GACAAATTTA TCGAAACTGG CAAGCTATTA GCCGATGACT CGAACTTACA TAAAGTATTC AACCGTGATT TTACCAACTC GTTTTTAAAG GGTAACCTCA CCAAAGACAT GTTTATCGAC AATCCACGCG ATAACAGCTT TAAACATGCT AACGATAAAA GTAACGCGAT ATCAGTGGTT CAAATTCAAG AAGCGCAACA AACACTGTCG TCAGAAAAAG ACGCCATTGT GCAACTGGTT GCCGAAAAAA TTAATCACCT GAGCATTGCC AAGCCAACGT TAGCACTGGC ATTTTCAGGT CAAGTTGATC AACCATTGTC GATCACGGTG ACCACGCCAG ACCAACACTT TGTGATTGAA TCATCTATCT CTCTAACCCA AGCGACAGAA TCGCGGGTTG ATGAAGCGGC CATTGAGAAA CGCTTTAAAA GCTTAAAAGG CAGTGGTTAC TTGTTACAAG CGTTTAATTA TGACGGCTTG CAGGCTGATT TAAGCCTACC ATTTAGTCAG TTAACTCAGC TTAAAAACCA ACTGTTATTA CAGCTAACAC AAAGAGAATA CATAGGTGCA GTAACCTTAC CCAAGTTACC TAAGCATCCT AAAGTAACAG AAACACCGAC ATTATCTTTG CTTATTAGTG ATGAAAAAGA CGTCAACTTA TGTGATGTTA CCGATGCCGA CATCTACTTT AAGCTGCCTG AAAGCTTTAA GAAAAACGAC AACAAGTATA TTGATATCTT TTTACGCAAC CCAAGATTAA TCCCGTGGTT TCCGGCTGTG CTTATTGGCA AAGATTACAT CGAAGCAGTA AGAGTGTTAG AAGTGGTTAA GCCTAAGCGA ATTGTTACTA ACAATACCGG CGTCGCGTTT AAAGCTTATG AAATGGACAT TGAGTGGATC GCAGGCCCGT TTTTAAACAC CACTAACTCT TATGCCTTGC TGACATTACA AGAGCAATTG AATTGTGCTG GTGCGTTTAT TTCGAACGAG ATTAATCGTA ATCAAATTAA AAACATCGCC AGGCCGGAAA ACTTCAAACT GCTTTACAGC ATCTATCACC CAATTTTGAT GATGACCAGC AGACAATGCT TTTTCCAACA AACGGTAGGC TGTAATAAGC CCAGCATTGA AGACGGTTGC ATGCTCAAGT GTGAAAAAGC CACTACCATC ACTAATGTGA AAGGTATTTC GTTTGCAGTA GACAAACAAA AAGGCGGTTA CCCGAGCATC TATAACCACG AGCAGTTTTT AAACATCGAA GCTGTTGAAG ATTTATCACA CTTATTCGAT GAGTTCTTTA TCGATCTCAC CAACATAGGA TCTGGCTCAA AAGCCGAAAT AGATAAAACT CAGCTGGTAA CCCACTTCGA AAATGTCCTT AAAGGCGTTA GCGAATCGAA ACTAGAGTTA AATCAACTGG TGACCATTTC GACTAATGCC CAGTATCAGC AAGGTTTATA A
|
Protein sequence | MNRKIELLAP GGDVDAIKAA IIAGANAVYC GLDTFNARNR AANLSFDDLI GVIALAHQYQ CEVFLTMNIV ILENEVPTLV KLLNQLVNTG IDGIIVQDLG LFNLVTKHFP SLAIHASTQL TTHNEGQIGF LAKIGATRVN LSRELNLGEI KSLTTLAHDH EVLTEVFVHG ALCIAFSGQC YSSSVSVGNS GNRGRCSQAC RDEYEVTAAG NKFPLNLKDN SAYFDLPELV DAGVDSLKIE GRIKGAQYVH TVVDSWRKQI DKFIETGKLL ADDSNLHKVF NRDFTNSFLK GNLTKDMFID NPRDNSFKHA NDKSNAISVV QIQEAQQTLS SEKDAIVQLV AEKINHLSIA KPTLALAFSG QVDQPLSITV TTPDQHFVIE SSISLTQATE SRVDEAAIEK RFKSLKGSGY LLQAFNYDGL QADLSLPFSQ LTQLKNQLLL QLTQREYIGA VTLPKLPKHP KVTETPTLSL LISDEKDVNL CDVTDADIYF KLPESFKKND NKYIDIFLRN PRLIPWFPAV LIGKDYIEAV RVLEVVKPKR IVTNNTGVAF KAYEMDIEWI AGPFLNTTNS YALLTLQEQL NCAGAFISNE INRNQIKNIA RPENFKLLYS IYHPILMMTS RQCFFQQTVG CNKPSIEDGC MLKCEKATTI TNVKGISFAV DKQKGGYPSI YNHEQFLNIE AVEDLSHLFD EFFIDLTNIG SGSKAEIDKT QLVTHFENVL KGVSESKLEL NQLVTISTNA QYQQGL
|
| |