Gene Shel_14440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_14440 
Symbol 
ID8395334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp1651048 
End bp1653501 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content56% 
IMG OID644986198 
Productsubtilase family protease 
Protein accessionYP_003143813 
Protein GI257064141 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCG CAATACACTA CATAGGCGCC CCGATCCAGA AGATGTGCGT CTGTCTGGTT 
TGGCTGTGCA TGCTCGTTGT ACTGGGCGGA TGTTCTGTAC AGCAAGATTA TAATCCGAGC
GATATATCTT TTGACACTAT CGACGATGCC GCTTTGCAGG TTGTCGATGA GGGAGACTCA
AATGAATCCG ATGTTTATGA ACCACTCGCC GATCCAGCTG TCACCAAGGG TACCGACTCG
GTTGTAGAAG CGTCGGATGA GCCGGGACTA CCCGATAGCG AGATAGAGGA TTTCGAGCAG
CTGCTCGTGG GCGGCTCCGG GGAATCAGGC ACCGTCGCCG ATGTGCCAAC AGACGCGTAT
GATGGCTACA TCGTCTCCGT TGACGAAGAG CGGTTGGCAG ACTTCCAATT CGCAATGGAG
CTCGAAGCCG CTGTCGATGC TGGAGTGCTG ACACCTGTAG CGGGCAACTT CTACACCTGC
AACGATTACA GGAGCCTCCC CGGAACTATT CATGCCGACA TGGTCGAGTC GATTGAGCCT
GATTACTATG TCGAGGCGTT TGACGATGGA TCGGACGAAG GCATTGTTCC CGATGTGCGT
CCAGTGCTGG AGTCGGAAAG CGGCCTTGAT CAGGCTCCTG AAACCGCATC CGACACGGAA
GTGATGGGCG ACGCCGTTAC ATCGGATACG CACGAAGCAG ACGCTTCCGA CGAATCCTCG
GAGCCTGATA CGGACGAATC TGCTGACGTT TCCGAAGCTG CGAAGTCTGA AGCGCAGCCT
GAGTCAGAAG AGACGGACGA AACCTCGGAA GAGCCGGTGG AGACGCAAGC GATCAAGGGG
TACAACGATC CGCGCATATC GGAGCAATGG TACCTTACAT CTACAAATGC CTCCGCAGCG
TGGGCGGCGG GATACACCGG TGCGGTCAGC GAATCTGCTA AGGGCGTCAG ACCGCTGGTC
GCGATCGTCG ACACGGGATT GTGCGGAACG GGGTCTTCTT CCGTAAAGCA TGAAGACATC
AATTATGGCA ACGTTGTAGC TGGCTGGAAC GTCGTCGCAG GTTCGTACGA TACCGCACCG
GTGTCTCGTC ACGGCACCAT GTGCGCCGGC ATAATCGGTG CCGAACAGAA CAACGGCAAG
GGCATAGCAG GCCTGGTGCC CGATGCCGCT ATGGCTCCCG TAATGATCTT CGGCGATTCG
GGCAGCACGA GCGTGAGTAA CCTTATAGCC GGTATTTATG CTGGTGTCGA CTATACGGGT
GCAAATGTCA TGAACCTCAG CGTGGGTGTG TCCGAAATGT ACTTTGAGAA CCACAGCATC
TCGCCGTTGC AGGCGGCCGT CGATTATGCG GCATCCAAAA ACGTGCTTAT GGTTGCGGCT
GTCGGCAACT ACGGCACCGG GTCAAATCCC CTCATGTACC CTGCAGGGTT CAGCAACGTT
GTGGGTGTAG GCGGCGTTTC GCAGGGAAGC CTAGACCACT ATCCGTCATC CGAGTTCAAC
GATAGCGTTT ATTGCGTGGC GCCTGGCCAA AACATTCTCA GCACGGCAAT AGCGAATAAG
TCTGCGTATT CCGCAGGAAG CGGAACGTCC TACGCTGCGC CTATGGTGAG TGCTCTTGCC
GCCATGTGCA AATCCGTCGA CGGCAGCATG ACGGTCTCCG AATTCAAGAA GTTCATCAGG
TCGACCAGCA CCGATTTGGG TGCTTCGGGG TACGACATCT ATTACGGCTA CGGCGTTATC
AACTTCAACG CGGCGGCCAA GCAGCTGAAT GCTATGGGAG GCCACGTGAG CCGGGAGGGC
TGGGTCCTCG AAGACGGGGG CCAGTACTAT TATGTGAACG ACAGCCCCGT TTGCAACGCA
TGGAAGAGAA TCGACGGGTA CTGGTATTAC TTTAAGTCCG ATGGGCGCGC GGCTGCCAGC
GAATGGGAGA AAGTGGGTTC GTACTGGTAT CACTTCGACA GCAGCGGGCA CATGCAGAAG
AACCGCTGGC TTAAATCCGG TGGCGGCTGG TACTACCTTG GAAGCAACGG AGCTGCGCTT
ACGGGGTGGC AGAAGCTGTC CTACGGAGGC TCTTCGAAAT GGTTCTATTT CAACGGTGAC
TGCAAGATGC TGACCGGGTT GCAGACTATA CGATACGGCG GTTCTGACAA GGTGTTCTAT
TTCGACGGTT CGGGCGTCAT GGTCACTGGC TGGCAGAAGG CCGGCGGGCA TTGGTACTAC
TTCGGAGGCG ACGGTGCCGG TGTAACCGGT TGGCAGAAGC TGGCGTATGG GTCGTCTACC
AGGTGGTATT ACTTTAACGG CGACGCCACG ATGGCCACCG GATGGAAGAA GATCAGGTAT
GCGGGAGCTG ACACGTGGTT CTATTTCGGC GGCGACGGCG CCATGCGAAC CGGCACGCAG
ACGATAGGTG GAAAAACTTA TCGCTTCGCC TCCAACGGCG CGTGGATAGG GTAA
 
Protein sequence
MKPAIHYIGA PIQKMCVCLV WLCMLVVLGG CSVQQDYNPS DISFDTIDDA ALQVVDEGDS 
NESDVYEPLA DPAVTKGTDS VVEASDEPGL PDSEIEDFEQ LLVGGSGESG TVADVPTDAY
DGYIVSVDEE RLADFQFAME LEAAVDAGVL TPVAGNFYTC NDYRSLPGTI HADMVESIEP
DYYVEAFDDG SDEGIVPDVR PVLESESGLD QAPETASDTE VMGDAVTSDT HEADASDESS
EPDTDESADV SEAAKSEAQP ESEETDETSE EPVETQAIKG YNDPRISEQW YLTSTNASAA
WAAGYTGAVS ESAKGVRPLV AIVDTGLCGT GSSSVKHEDI NYGNVVAGWN VVAGSYDTAP
VSRHGTMCAG IIGAEQNNGK GIAGLVPDAA MAPVMIFGDS GSTSVSNLIA GIYAGVDYTG
ANVMNLSVGV SEMYFENHSI SPLQAAVDYA ASKNVLMVAA VGNYGTGSNP LMYPAGFSNV
VGVGGVSQGS LDHYPSSEFN DSVYCVAPGQ NILSTAIANK SAYSAGSGTS YAAPMVSALA
AMCKSVDGSM TVSEFKKFIR STSTDLGASG YDIYYGYGVI NFNAAAKQLN AMGGHVSREG
WVLEDGGQYY YVNDSPVCNA WKRIDGYWYY FKSDGRAAAS EWEKVGSYWY HFDSSGHMQK
NRWLKSGGGW YYLGSNGAAL TGWQKLSYGG SSKWFYFNGD CKMLTGLQTI RYGGSDKVFY
FDGSGVMVTG WQKAGGHWYY FGGDGAGVTG WQKLAYGSST RWYYFNGDAT MATGWKKIRY
AGADTWFYFG GDGAMRTGTQ TIGGKTYRFA SNGAWIG