Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1412 |
Symbol | |
ID | 4251990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 1647958 |
End bp | 1650747 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 638118011 |
Product | Insulysin |
Protein accession | YP_733547 |
Protein GI | 113969754 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.528026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.896637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAAGCCT CTCAGTCTAT TTACAAAAGT CCAAACGATC ATCGTCAGTA CCGTTATCTC GTGTTAGATA ATGCCCTCAG GGTATTATTG GTCGAAGATT TGGACGCCAG CCAAGCCGCC GCTTCCATGG CGGTGGCCGT AGGACATTTT GATGATCCTG TGGATAGACC CGGTATGGCA CATTTTTTAG AGCATATGCT CTTTTTAGGC ACTGAAAAAT TCCCTGACTC CGGTGAGTAT CATGCCTTTA TCAATCAGCA TGGAGGCAGT AATAATGCGT GGACCGGCAC AGAGCACACC AATTTCTTTT TTACCATCAA CGCCGACGTA TTTGCCGGCT CCCTAGACAG ATTTAGCCAA TTCTTTATCG CCCCCAAGTT TGACCTCGAC CTGGTCGATA GGGAGCGCCA AGCGATTGAA TCTGAATTTA GTTTAAAGCT GAAGGACGAC ATCCGCCGCA CCTATCAAGT GCTTAAAGAA ACCGTCAATC AACAGCATCC GTTTTCGAAA TTTTCTGTCG GCAATTTAGT CACCCTTGGC GGAGAGCAGG CGCAGGTCAG AAGCGAGCTA TTGGCGTTTT ATCAAAGCCA TTACAGTGCC AATTTAATGA CGCTCTGTTT AGTCGCCCCG ATGCCGTTAG ATGATCTTCA AGCGCTTGCG GCCCAATATT TCAGTGCGGT CCGCAACTTA AACTTAGTCA AACAGTATCC TGATGTGCCG CTGTTTTCAG AAAATGAACT GCTTAAGCAA ATCAATATTG TCCCCTTAAA GGAACAAAAG CGTTTAAGTA TCAGTTTTAA CTTCCCTGGC ATCGACCACT ACTACAAACG TAAGCCGCTG ACTTACATTA GCCATATCCT CGGCAATGAG AGTAAGGGTA GCCTGCTGTC GTATCTTAAA GAACAGGGGT TAGTGAATAA TCTCTCCGCG GGCGGCGGAG TCAATGGCTA TAACTTCAAG GACTACAGCA TAGGTCTGCA ACTGACCGAC AAAGGCGTGG CCAATATTGA CGATATTGTG TGCAGCTGCT TTGAGTATAT CGAGCTGATA AAAAACCAAG GCCTTGAGGA TTGGCGCTAT TTAGAACGAG CCAATCTGCT CAAGATGGCA TTTCGCTATC AGGAGCAAGT GAAATCCCTC GATTTAGCCA GCCACTTAAG CATCAACATG CACCACTACG AAGTGGAAGA CTTAGTCTTT GGTGACTATC GCATGGATGG CTTAGATATT AACGAAACCC TTGAGCTATT GAACTTAATG ACGCCGCAAA ATATGCGCTT ACAACTCATC GCACAGTCCG TTAAGACCGA CCGTAAAGCC AATTGGTACC ATACGCCATA TCAAGTCTTA CCGATAAAAC CAGAATCCCT TGCCCGTTGG CAAGTGACAC AAATTCGCCC TGAGTTGCAA TTGCCCGCAG CCAATCCCTT TATCGTGGCA GACTCCATTG CCCGCCCCGA TAAAAGTGAA GTCGCCGTGC CTGTGATTGT CGCTGAATCA ACAGGATACC GGATTTGGCA TAAAAAAGAC GATGAGTTTA ATGTCCCGAA AGGCCATATG TATCTATCGC TCGATTCGGA ACAGGCGAGT AAAACCCCAA AACATGCTGC CCTCACCCGC TTATACGTTG AGATGCTGCT CGATTATTTG ACTGAGCCAA CTTATCAGGC TGAAGTTGCT GGACTAAGTT ATAACATTTA CCCCCATCAG GGCGGGATCA CACTGCATCT ATCTGGTTTT ACGGGCAATC AAGAGACCCT GCTAGCCTTA CTTATCCAAA AGGCGAGAGA ACGTAATTTC ACTGAAGAAC GGTTCGCGTT AATTAAATCT CAGCTGCTGC GTAGCTGGCA AAACTTGGCG CAAGCCAAAC CCATTTCACA GCTATTTACT AGTCTGACCT CTACGTTGCA AAAACGCAGC TACGAGCCTG CTCGTATGGC ACAGCTGCTT GAAAATATTA CCTTAAATGA TCTCCATAAC CATGTACGCG CCTTTTACGA GAAGATTTAC CTCGAAGGGC TGATTTACGG CGATTGGCTT GTATCAGAGG CGCAAGCCTT AGGGAAACGA CTGGAACATA TCTTGTCCCT CGTCTCAAGC CCAAGCGCGG AGTCGACTCG AGAATTAATC AACTTAACAG GACAAGGCAC CCTCCTTCGG GAACTGGCAA TAGATCATCA AGACAGTGCA ATTATTGTGT ATTATCAATC AGCTATAGCA ACACCTGAGA AAATGGCGCT GTTTAGCTTA CTCAATCACA CTATGTCTTC GACCTTCTTC CATGAGCTAC GTACCGAAAA ACAATTGGGC TACATGGTCG GCACAGGTTA TCTGCCACTC AATCGCCATC CGGGACTCAT TTTCTATATT CAATCCCCGA CGACGGGACC ACTGCATTTA TTAGAGGCTA TTGATGAATT TATCGCGGAC TTTAATTATG CCGTGATGCA AATTACCAAT GAAGAATGGG AAAGCACCAA ACAGGGACTC ATCAATCAAG TGATGGAGCA TGATGCTAAC CTAAAAACTC GTGGCCAGCG TTATTGGGTG AGCGTGGGCA ACCGTGATTA TCAGTTTAAT CAGCGTGAAT TAGTGGTCGC GGAAATCAAC AAACTCACTC GACCAGATCT CATCAAGTTT ATGATGCGAA AAATGCGCAC TAAGCATAGC GATAGACTCG TGCTTTTTAG CACAGGTTCT GCCCATGCGG CGCAGTCAGC GCTCAAATCA GAGAATATGA TTACCGATCT TAAACTCTTT AAACAAAACA CTGAAAAGTT CAATTTTTAG
|
Protein sequence | MQASQSIYKS PNDHRQYRYL VLDNALRVLL VEDLDASQAA ASMAVAVGHF DDPVDRPGMA HFLEHMLFLG TEKFPDSGEY HAFINQHGGS NNAWTGTEHT NFFFTINADV FAGSLDRFSQ FFIAPKFDLD LVDRERQAIE SEFSLKLKDD IRRTYQVLKE TVNQQHPFSK FSVGNLVTLG GEQAQVRSEL LAFYQSHYSA NLMTLCLVAP MPLDDLQALA AQYFSAVRNL NLVKQYPDVP LFSENELLKQ INIVPLKEQK RLSISFNFPG IDHYYKRKPL TYISHILGNE SKGSLLSYLK EQGLVNNLSA GGGVNGYNFK DYSIGLQLTD KGVANIDDIV CSCFEYIELI KNQGLEDWRY LERANLLKMA FRYQEQVKSL DLASHLSINM HHYEVEDLVF GDYRMDGLDI NETLELLNLM TPQNMRLQLI AQSVKTDRKA NWYHTPYQVL PIKPESLARW QVTQIRPELQ LPAANPFIVA DSIARPDKSE VAVPVIVAES TGYRIWHKKD DEFNVPKGHM YLSLDSEQAS KTPKHAALTR LYVEMLLDYL TEPTYQAEVA GLSYNIYPHQ GGITLHLSGF TGNQETLLAL LIQKARERNF TEERFALIKS QLLRSWQNLA QAKPISQLFT SLTSTLQKRS YEPARMAQLL ENITLNDLHN HVRAFYEKIY LEGLIYGDWL VSEAQALGKR LEHILSLVSS PSAESTRELI NLTGQGTLLR ELAIDHQDSA IIVYYQSAIA TPEKMALFSL LNHTMSSTFF HELRTEKQLG YMVGTGYLPL NRHPGLIFYI QSPTTGPLHL LEAIDEFIAD FNYAVMQITN EEWESTKQGL INQVMEHDAN LKTRGQRYWV SVGNRDYQFN QRELVVAEIN KLTRPDLIKF MMRKMRTKHS DRLVLFSTGS AHAAQSALKS ENMITDLKLF KQNTEKFNF
|
| |