Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_20700 |
Symbol | |
ID | 8395959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 2298537 |
End bp | 2301455 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644986819 |
Product | predicted Zn-dependent peptidase, insulinase |
Protein accession | YP_003144430 |
Protein GI | 257064758 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTTG AAATCGGCGA GACCCTGCAC GGGTTTCGCG TCAGCTCGGT CGAGCCTCTT TCCGAAATCG ACGGCGAAGC CATCGTCATG CGTCACGAGC GCAGCGGTGC GCGTCTGCTG TTCCTGAAGA ACGAGGATGA AAACAAGGCG TTTTCCATCT CGTTCAAAAC GCCTCCGAAA GACAGCACGG GCGTGTTCCA CATCCTAGAG CATTCGGTGC TCTGCGGGTC CGAGAAGTTC CCGGTCAAGG AGCCGTTCGT CAATCTGCTC AAGACGTCCA TGCAGACGTT TTTGAACGCG ATGACCTTCC CGGACAAGAC GATGTACCCC GTGGCCAGCA CAAACATGCA GGACCTCATG AACCTGACGG ACGTCTACAT GGACGCCGTG CTTCGTCCTA ACATCTATCT GAAGCGTCAG CTGTTCGAGC AGGAAGGCTG GCACTACGAG CTGGACGAGG CCGATGAGGG CGCCGGGTCT CCCGAGCGCC TGCGCTACAA CGGTGTGGTG TTCAACGAGA TGAAGGGCGC GCTTTCCGAC CCCGAGGACG TGCTCAACTA CGAGCTCAAC AAAGCACTGT TCCCGAACAC CTGCTATGCG TTCGAGTCGG GCGGGCATCC GCGTAAGATT CCGACGCTCA CCTACGAGGA TTACCTGGAC ACCCACGCGC GCCATTACCG GCTGGACAAC TCCTACATCA TCCTGTACGG CGACATCGAC GCCGACCGCA TGCTGGGCCA TCTGGACGAA GAGTACCTGT CGGTCATCGA GCCGCGTGTG GAAGAAGGCC CCAATCCCAT TGGCATCCAG GAGCCGCTGG TCAACATGGA TGTGGTAGTG CCCATGGGCA CGGCTCCCGA AAACGCCTGC GTGGCCTTGG GATACGTGGT CGGCACTGCA CGCGATTTCG AACGTGTGCT GGCCACCGAC GTGCTGCTCG ATGCGCTGTT GGGCGGAAAC GAGTCGCCCA TCAAGCGTGC ACTGCTTGAC GAGGAGCTGG GCGGCAACGT GTTCTCGTAC CTGATGGATT CCCAGGCGCA GCCTGTGGCC ATGATCGGCG TCCGCAACGC CAAGCCCGGC ATCCGCACCC GCCTGCGCGA AGTGGTTGAG GAGCAGGCCG CCAAGCTGGT GCAGGAAGGC ATCCCCCGCG ACGTGCTGAA CGCGTCGCTT TCGCAGATCG CCTTCATGCT GCGCGAGCGC GACCGCGGCA TTGCCGACGG CGTGCCGCTG GCGATGAACG CCATGGCCGG CTGGCTGTAC GACGAGGACA TGCCCACCAC GTACCTTCGC TACGAGGAGC CGCTGGCCCA CATGCGCGAA GGTCTCGAGA ACGGGTACTT CGAGCGCCTG CTGGACGAGC TCATCGTCAA GAGCAACCAT AAGGCGCTGG TGGAGGTCCT TCCCACCGAG CCCGAAGGAG AAGGTGAAGA AGCCGCAGAG CTGGCCGAGA AGCTCGCGTC GATGACCGAA GCCGACAAGC AGGCCGTCCG CGACGACGTT GCGCTGCTGC GCAAGCACCA GGAGACGCCC GACGCACCGG AAGACGTGGC GAAGCTGCCC ATGCTGCACG TGTCCGACAT TGGACCTGCC AAGCCCGACC CAGCGTTCGA AGTCCTCGAG GACACGCCGT TGACCTGCCT GTTCCATGAG CTGCCGACGA GGCACATCGA CTACGTGTAC CACTATTTCG ACATCATGGA TCTGGATTGG GAGGACGTCC CGTACCTGAC GTTGCTGTCC GTGTTCACGG GCAGGCTGGC CACCGCAACC CGTTCGGCGG CTGAGGTGGA CGTATGGACG CGCCAGCATC TGGGCAGCCT TCATGTGGCC GCAGAGCCAC TGGTTGCTGA AGACGATCCT TCGAAAATCT CGTATCGCCT GGTGGTGGCC GCATCGGCCG TTGCCGAAGA AATCGAAAGC CTGGCTTCCA TCCCCATGGA GGTGTGCACG TCCATGCAGT TCGACGATGC CGGAAGGATG CGGGACATCC TTATCCAGCG TCGTGTCGGA CTGGAGCAGG CCTTTGCCAA CAACGGCCAT ATGTGCGCAT CGTCGCGCGT GGCGTCCTAT CTCATGCCGG CCGCCGTTCT GGCCGAGCAG AGCAACGGTG TGGACTACTA CAGGTTCCTG AAAGACCTGC TGGACCATTT CGACGAGCGT TTCGAGGGCC TGAAAGCGAA GCTCACCGAG CTGCAGAGCC GCATCTTCAC CAGGAACGGT CTGGTCACCA GCTTCGTGGG TTCGCGCGAG GAGCTTGACG CGTACTGGCG GGCCGCAGGG GATCTGGATC TTCCTGAAGG GGAGGAGAAG GTCCGGCGCC TGGTCATCCC CGAGCCCGTG GTGAAAAACG AGGCGTTCAT CGTGCCGACG GACGTGTGCT ATGTATCCAA GGGAACGATT GCATCTTCAG TGGGTTCCTA TTCGGGCTTG TGGCCGGTGG CATCGGCTGC TCTTTCTTAC AACTACCTGT GGAGCGAAGT CCGTGTGAAG GGCGGCGCCT ACGGTGTCGG ATTCCGCCGC ACGACCGCCG GTTTCGCACG ATTCCACACC TATAGGGACC CGAACATCGA CGAGAGCCTG CGCCGCTTCG ACGAGGCTGC CGCATGGCTG GCCGCCTTTG AGCCTACGCA AGACGAGATG GAGGGCTACA TCGTGAGCAC CGTGGCCACC CATGACTCGC CGGTCAAGCC CAAGCATATC GCCCGCCGGC AGGATACGGC CTATTTCAGG GACGACCCGA TGGACCTGCG CGAGCGTCGC CGCGAAGAGG AGCTCTCTGC CACGCCGCAG TCCATTCGCG ACTGCTCGGC GGTGCTGCGG AAGATCGCCG ATGAGGGTGC GTGGTGCGTG TTCGGAAACG AAAACATGAT TCGTTCGGCG ACCACGCCGT TGAATGTGAT TGACCTGCTG AACGAATAG
|
Protein sequence | MAFEIGETLH GFRVSSVEPL SEIDGEAIVM RHERSGARLL FLKNEDENKA FSISFKTPPK DSTGVFHILE HSVLCGSEKF PVKEPFVNLL KTSMQTFLNA MTFPDKTMYP VASTNMQDLM NLTDVYMDAV LRPNIYLKRQ LFEQEGWHYE LDEADEGAGS PERLRYNGVV FNEMKGALSD PEDVLNYELN KALFPNTCYA FESGGHPRKI PTLTYEDYLD THARHYRLDN SYIILYGDID ADRMLGHLDE EYLSVIEPRV EEGPNPIGIQ EPLVNMDVVV PMGTAPENAC VALGYVVGTA RDFERVLATD VLLDALLGGN ESPIKRALLD EELGGNVFSY LMDSQAQPVA MIGVRNAKPG IRTRLREVVE EQAAKLVQEG IPRDVLNASL SQIAFMLRER DRGIADGVPL AMNAMAGWLY DEDMPTTYLR YEEPLAHMRE GLENGYFERL LDELIVKSNH KALVEVLPTE PEGEGEEAAE LAEKLASMTE ADKQAVRDDV ALLRKHQETP DAPEDVAKLP MLHVSDIGPA KPDPAFEVLE DTPLTCLFHE LPTRHIDYVY HYFDIMDLDW EDVPYLTLLS VFTGRLATAT RSAAEVDVWT RQHLGSLHVA AEPLVAEDDP SKISYRLVVA ASAVAEEIES LASIPMEVCT SMQFDDAGRM RDILIQRRVG LEQAFANNGH MCASSRVASY LMPAAVLAEQ SNGVDYYRFL KDLLDHFDER FEGLKAKLTE LQSRIFTRNG LVTSFVGSRE ELDAYWRAAG DLDLPEGEEK VRRLVIPEPV VKNEAFIVPT DVCYVSKGTI ASSVGSYSGL WPVASAALSY NYLWSEVRVK GGAYGVGFRR TTAGFARFHT YRDPNIDESL RRFDEAAAWL AAFEPTQDEM EGYIVSTVAT HDSPVKPKHI ARRQDTAYFR DDPMDLRERR REEELSATPQ SIRDCSAVLR KIADEGAWCV FGNENMIRSA TTPLNVIDLL NE
|
| |