Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_0828 |
Symbol | |
ID | 4256864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | + |
Start bp | 939902 |
End bp | 941818 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638121439 |
Product | peptidase U32 |
Protein accession | YP_736884 |
Protein GI | 114046334 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00452719 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC CAGAGATTTA CACACCTGAG ACACTTTCTC ACGTTAATAA CCGCTTAGAG TTATTGGCGC CTGCAAAAAA TGCCGATTAT GGCATTGAAG CCATTCGCCA TGGTGCCGAC GCGGTTTATA TCGGTGGTCC CGCATTCGGC GCGCGTGCGA CCGCGGGTAA CAGTGTAGAA GATATCGCCC GTCTGTGTAC TTTTGCTCAT AAGTATCATG CGCAAGTGTT TGTGGCCATT AACACTATTC TGATGGACGA TGAACTCGAG ACCGCTGAAA AGCTGATTTG GGATGTGTAT AACGCGGGCG CCGATGCACT GATTGTGCAG GACATGGGCG TATTGCAACT TAACCTGCCG CCGATTGCGC TGCATGCTAG TACGCAAATG GATAACCGTA ATCCCGAGAA AGTCGCCTTC TTAGAGCAAG TGGGTTTCTC ACAAGTGGTA TTGGCGCGTG AGTTAGGTTT AAGCCAAATC CGTGAGGTTG CGGCTCACAC CAATATGCAG ATTGAGTTCT TTATTCACGG CGCCCTGTGT GTGGCCTACA GCGGCTTATG TAACTTAAGT CATGCCTTCA GTAACCGCAG TGCAAACCGC GGCGAATGTT CGCAAATGTG TCGTCTGCCG GGCAATCTTA AGACCCGCCA AGGGGATGTG TTGGCGCAAA ATGAGCACTT ACTCTCATTA AAAGACAATA ACCAAACCGA TAACCTCGAA GCCTTGATTG ATGCCGGCGT GCGCTCATTC AAAATTGAGG GGCGTTTAAA GGACTTAAGT TACGTTAAAA ACGTGACCGC CCATTATCGT CAAAAGCTCG ATGCCATTAT GGCGCGTCGC CCTGAGTTTG TGGCGTCATC CCATGGCCGT ACTGAACATA CCTTTACTCC GGATCCCGAA AAAACCTTTA ACCGTGGCAG CACAGATTAC TTTGTGAATG AGCGTAGCCA AGGGATTAAA GACTTCCGCT CGCCAAAATA TATCGGGCAA GATGTGGGTA AAGTGGTCGC CATCGGCAAA GACTTTATTC AAGTCAGTTC AACCCACGAG TTTAATAACG GCGATGGTTT AGCCTATTTC CCGCCAAACT ATGCGATGGC CAAACAGTCC GATGACAAAT TGCAGGGACT GCGTGTTAAC CGTGCCGAAG GTCATAAGCT GCATGTATTG CAGGTGCCGC GCGATCTGCG TGTTGGTATG ACCTTATACC GTAACCATAA CCAAGCTTTC GAGACGTTAC TTTCTAAGGA GTCAGCCAAG CGTATTATCG GCGTCGACAT GCGTTTAACC GATACTGCTA CGGGCGTGGC GCTGACCTTA ACGGATATTT ACGGCCTCAG TGCGACGGTT GAGCTTGCAG TCGAAAAGAC GCCCGCCACC GACGCTGAAA AAACCTTGCA GACTATCCGT ACTCAATTGT CTAAGCTGGG CAGTACCGAT TTTACTGCGC GCCAGATCAG TATCGAAACT GCCGAGCCTT GGTTCCTGCC TGCATCTGTC CTCAACGGCC TGCGCCGTGA TGCGGTCGCA GCATTAGAAC TTGCCCGTGT TGAAGGTTAC CAGCGTCCAA AACCTTGGAA ATATAATCAA GATGCCGTCT ATCCATTCAA ACACTTAAGT TACTTGGGTA ACGTGGCAAA CGAAAAGGCG AAGGACTTTT ATCAACGCCA TGGCGTGATT GAAATTCAGG ACACCTACGA GAAAAACGGC GTGACCGAAG ATGTGCCGTT AATGATCACT AAGCATTGCC TGCGATTTAA CTTCAATCTC TGTCCTAAGG AAGTGCCGGG CATTAAGGCG GATCCTATGG TGCTTGAGAT AGGTAACGAT GTGCTTAAGT TAGTTTTCGA TTGTCCAAAA TGCGAGATGA TGGTCGTCGG TGAAAACCGT CAGGTTCGCG GTCAAAAAGC CGTTTAA
|
Protein sequence | MSQPEIYTPE TLSHVNNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE DIARLCTFAH KYHAQVFVAI NTILMDDELE TAEKLIWDVY NAGADALIVQ DMGVLQLNLP PIALHASTQM DNRNPEKVAF LEQVGFSQVV LARELGLSQI REVAAHTNMQ IEFFIHGALC VAYSGLCNLS HAFSNRSANR GECSQMCRLP GNLKTRQGDV LAQNEHLLSL KDNNQTDNLE ALIDAGVRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFVASSHGR TEHTFTPDPE KTFNRGSTDY FVNERSQGIK DFRSPKYIGQ DVGKVVAIGK DFIQVSSTHE FNNGDGLAYF PPNYAMAKQS DDKLQGLRVN RAEGHKLHVL QVPRDLRVGM TLYRNHNQAF ETLLSKESAK RIIGVDMRLT DTATGVALTL TDIYGLSATV ELAVEKTPAT DAEKTLQTIR TQLSKLGSTD FTARQISIET AEPWFLPASV LNGLRRDAVA ALELARVEGY QRPKPWKYNQ DAVYPFKHLS YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMIT KHCLRFNFNL CPKEVPGIKA DPMVLEIGND VLKLVFDCPK CEMMVVGENR QVRGQKAV
|
| |