Gene Shel_20700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_20700 
Symbol 
ID8395959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp2298537 
End bp2301455 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content62% 
IMG OID644986819 
Productpredicted Zn-dependent peptidase, insulinase 
Protein accessionYP_003144430 
Protein GI257064758 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTG AAATCGGCGA GACCCTGCAC GGGTTTCGCG TCAGCTCGGT CGAGCCTCTT 
TCCGAAATCG ACGGCGAAGC CATCGTCATG CGTCACGAGC GCAGCGGTGC GCGTCTGCTG
TTCCTGAAGA ACGAGGATGA AAACAAGGCG TTTTCCATCT CGTTCAAAAC GCCTCCGAAA
GACAGCACGG GCGTGTTCCA CATCCTAGAG CATTCGGTGC TCTGCGGGTC CGAGAAGTTC
CCGGTCAAGG AGCCGTTCGT CAATCTGCTC AAGACGTCCA TGCAGACGTT TTTGAACGCG
ATGACCTTCC CGGACAAGAC GATGTACCCC GTGGCCAGCA CAAACATGCA GGACCTCATG
AACCTGACGG ACGTCTACAT GGACGCCGTG CTTCGTCCTA ACATCTATCT GAAGCGTCAG
CTGTTCGAGC AGGAAGGCTG GCACTACGAG CTGGACGAGG CCGATGAGGG CGCCGGGTCT
CCCGAGCGCC TGCGCTACAA CGGTGTGGTG TTCAACGAGA TGAAGGGCGC GCTTTCCGAC
CCCGAGGACG TGCTCAACTA CGAGCTCAAC AAAGCACTGT TCCCGAACAC CTGCTATGCG
TTCGAGTCGG GCGGGCATCC GCGTAAGATT CCGACGCTCA CCTACGAGGA TTACCTGGAC
ACCCACGCGC GCCATTACCG GCTGGACAAC TCCTACATCA TCCTGTACGG CGACATCGAC
GCCGACCGCA TGCTGGGCCA TCTGGACGAA GAGTACCTGT CGGTCATCGA GCCGCGTGTG
GAAGAAGGCC CCAATCCCAT TGGCATCCAG GAGCCGCTGG TCAACATGGA TGTGGTAGTG
CCCATGGGCA CGGCTCCCGA AAACGCCTGC GTGGCCTTGG GATACGTGGT CGGCACTGCA
CGCGATTTCG AACGTGTGCT GGCCACCGAC GTGCTGCTCG ATGCGCTGTT GGGCGGAAAC
GAGTCGCCCA TCAAGCGTGC ACTGCTTGAC GAGGAGCTGG GCGGCAACGT GTTCTCGTAC
CTGATGGATT CCCAGGCGCA GCCTGTGGCC ATGATCGGCG TCCGCAACGC CAAGCCCGGC
ATCCGCACCC GCCTGCGCGA AGTGGTTGAG GAGCAGGCCG CCAAGCTGGT GCAGGAAGGC
ATCCCCCGCG ACGTGCTGAA CGCGTCGCTT TCGCAGATCG CCTTCATGCT GCGCGAGCGC
GACCGCGGCA TTGCCGACGG CGTGCCGCTG GCGATGAACG CCATGGCCGG CTGGCTGTAC
GACGAGGACA TGCCCACCAC GTACCTTCGC TACGAGGAGC CGCTGGCCCA CATGCGCGAA
GGTCTCGAGA ACGGGTACTT CGAGCGCCTG CTGGACGAGC TCATCGTCAA GAGCAACCAT
AAGGCGCTGG TGGAGGTCCT TCCCACCGAG CCCGAAGGAG AAGGTGAAGA AGCCGCAGAG
CTGGCCGAGA AGCTCGCGTC GATGACCGAA GCCGACAAGC AGGCCGTCCG CGACGACGTT
GCGCTGCTGC GCAAGCACCA GGAGACGCCC GACGCACCGG AAGACGTGGC GAAGCTGCCC
ATGCTGCACG TGTCCGACAT TGGACCTGCC AAGCCCGACC CAGCGTTCGA AGTCCTCGAG
GACACGCCGT TGACCTGCCT GTTCCATGAG CTGCCGACGA GGCACATCGA CTACGTGTAC
CACTATTTCG ACATCATGGA TCTGGATTGG GAGGACGTCC CGTACCTGAC GTTGCTGTCC
GTGTTCACGG GCAGGCTGGC CACCGCAACC CGTTCGGCGG CTGAGGTGGA CGTATGGACG
CGCCAGCATC TGGGCAGCCT TCATGTGGCC GCAGAGCCAC TGGTTGCTGA AGACGATCCT
TCGAAAATCT CGTATCGCCT GGTGGTGGCC GCATCGGCCG TTGCCGAAGA AATCGAAAGC
CTGGCTTCCA TCCCCATGGA GGTGTGCACG TCCATGCAGT TCGACGATGC CGGAAGGATG
CGGGACATCC TTATCCAGCG TCGTGTCGGA CTGGAGCAGG CCTTTGCCAA CAACGGCCAT
ATGTGCGCAT CGTCGCGCGT GGCGTCCTAT CTCATGCCGG CCGCCGTTCT GGCCGAGCAG
AGCAACGGTG TGGACTACTA CAGGTTCCTG AAAGACCTGC TGGACCATTT CGACGAGCGT
TTCGAGGGCC TGAAAGCGAA GCTCACCGAG CTGCAGAGCC GCATCTTCAC CAGGAACGGT
CTGGTCACCA GCTTCGTGGG TTCGCGCGAG GAGCTTGACG CGTACTGGCG GGCCGCAGGG
GATCTGGATC TTCCTGAAGG GGAGGAGAAG GTCCGGCGCC TGGTCATCCC CGAGCCCGTG
GTGAAAAACG AGGCGTTCAT CGTGCCGACG GACGTGTGCT ATGTATCCAA GGGAACGATT
GCATCTTCAG TGGGTTCCTA TTCGGGCTTG TGGCCGGTGG CATCGGCTGC TCTTTCTTAC
AACTACCTGT GGAGCGAAGT CCGTGTGAAG GGCGGCGCCT ACGGTGTCGG ATTCCGCCGC
ACGACCGCCG GTTTCGCACG ATTCCACACC TATAGGGACC CGAACATCGA CGAGAGCCTG
CGCCGCTTCG ACGAGGCTGC CGCATGGCTG GCCGCCTTTG AGCCTACGCA AGACGAGATG
GAGGGCTACA TCGTGAGCAC CGTGGCCACC CATGACTCGC CGGTCAAGCC CAAGCATATC
GCCCGCCGGC AGGATACGGC CTATTTCAGG GACGACCCGA TGGACCTGCG CGAGCGTCGC
CGCGAAGAGG AGCTCTCTGC CACGCCGCAG TCCATTCGCG ACTGCTCGGC GGTGCTGCGG
AAGATCGCCG ATGAGGGTGC GTGGTGCGTG TTCGGAAACG AAAACATGAT TCGTTCGGCG
ACCACGCCGT TGAATGTGAT TGACCTGCTG AACGAATAG
 
Protein sequence
MAFEIGETLH GFRVSSVEPL SEIDGEAIVM RHERSGARLL FLKNEDENKA FSISFKTPPK 
DSTGVFHILE HSVLCGSEKF PVKEPFVNLL KTSMQTFLNA MTFPDKTMYP VASTNMQDLM
NLTDVYMDAV LRPNIYLKRQ LFEQEGWHYE LDEADEGAGS PERLRYNGVV FNEMKGALSD
PEDVLNYELN KALFPNTCYA FESGGHPRKI PTLTYEDYLD THARHYRLDN SYIILYGDID
ADRMLGHLDE EYLSVIEPRV EEGPNPIGIQ EPLVNMDVVV PMGTAPENAC VALGYVVGTA
RDFERVLATD VLLDALLGGN ESPIKRALLD EELGGNVFSY LMDSQAQPVA MIGVRNAKPG
IRTRLREVVE EQAAKLVQEG IPRDVLNASL SQIAFMLRER DRGIADGVPL AMNAMAGWLY
DEDMPTTYLR YEEPLAHMRE GLENGYFERL LDELIVKSNH KALVEVLPTE PEGEGEEAAE
LAEKLASMTE ADKQAVRDDV ALLRKHQETP DAPEDVAKLP MLHVSDIGPA KPDPAFEVLE
DTPLTCLFHE LPTRHIDYVY HYFDIMDLDW EDVPYLTLLS VFTGRLATAT RSAAEVDVWT
RQHLGSLHVA AEPLVAEDDP SKISYRLVVA ASAVAEEIES LASIPMEVCT SMQFDDAGRM
RDILIQRRVG LEQAFANNGH MCASSRVASY LMPAAVLAEQ SNGVDYYRFL KDLLDHFDER
FEGLKAKLTE LQSRIFTRNG LVTSFVGSRE ELDAYWRAAG DLDLPEGEEK VRRLVIPEPV
VKNEAFIVPT DVCYVSKGTI ASSVGSYSGL WPVASAALSY NYLWSEVRVK GGAYGVGFRR
TTAGFARFHT YRDPNIDESL RRFDEAAAWL AAFEPTQDEM EGYIVSTVAT HDSPVKPKHI
ARRQDTAYFR DDPMDLRERR REEELSATPQ SIRDCSAVLR KIADEGAWCV FGNENMIRSA
TTPLNVIDLL NE