Gene Snas_4892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4892 
Symbol 
ID8886099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5195535 
End bp5198804 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content69% 
IMG OID 
Productpeptidase S41 
Protein accessionYP_003513626 
Protein GI291302348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG GTTACCCGCG TTTTCCCGCC ATTCACAACG ACACCATCGT CTTCGTCGCC 
GAGGACGACC TGTGGAGCGT CACGACGGCC GGCGGCACCG CCCGGCGCCT GACCGCGGGG
GTCGCGGCGG CAAGCTTCCC GCGGATCTCG CCGGACGGTT CGCTCATCGC CTACACCGGT
GCCGAGGAGG GTCCGCCGGA CGTGCACGTG GTGCCCACCG AGGGCGGCGA GACCCGGCGG
CTGACCTACG AGGGCGGCCG GATCAGCCGG GTCGCCGGTT GGGACCCCAC CGGCGAGTTC
GTCATCTACG CCACCAGCGC GCACTCGCCG GAACTGCTGG ACGTCCGGCT GCGCCGGGTG
TCCGCCGACG GCGGCGTGCC CGCCGAGCTG ACCTGGGGCC GCGCCACCGC GATCGCCCAA
GGGTCGAACG GCGTCGTGGT GCTGGGCCGC GAGTACTGGA AGGACCACGC CCACTGGAAG
CGCTACCGGG GCGGCACCGC CGGTCAACTG TGGATCGACG CGTCCGGTGA CGGCGAGTTC
CGCAAGCTGG TCTCCTTGGA CGGCAACCTG TCCTGCCCGC ACATCATCGG CGACCGCGTC
TACTTCCTGT CCGACCACGA AGGACACGGC AACGTCTACT CCACCGACTT CGACGGCGCG
AACCTGCGCC GCCACACCGA CCACGAGGAC TTCTACGCCC GCGGCCTGTC CGGTGACGGC
GCCAGGCTGG TGTACCACTG CGGCGGCAGG CTCTACCTGC TCGACCCGGC CGAGGAGCAC
CCCAGCGTCG TCGACGTCCA CATCCCGGTG ACCCGCACCC AGCGCGCCCG CCGCTTCGTC
GACGCCGCCG AGTACCTGGA CTCGGTGGAC CTGTCCGCCG ACGCCTCCCA CCTGGCCGTC
ACCACCCGGG GCAAGGCGTT CACCTTCGCC GACTGGGAGG GACCGGTCAC GCAGCACGGC
GAGGTCGACG GCGTCCGGTA CCGGCTGCTG ACCTGGCTGC ACGACCGCGA ACGCCTGATC
GCGGCGGTCG CCGACAACGG CCCCCGCGAG GTGCTCGTCG GCATCACCGC CGACGCCTCC
AAACCGCCGT GCCGCCTCGA CCACCTCGAC ACCGGCCGGG CCGGGGAACT GGTGGCCTCC
CCGACCGAGG GCAAGGTCGT CATCGCCAAC CACCGCAACG AACTGGTACT GGTCGATGTG
GACGGCGAGG AGGCCACGGC CACGGTCATT GACTCCAGCC GCCACGGCGA GATCTCCGAC
GTGGTGTTCT CCGCCGATGG CACCTGGCTG GCCTACGCCT GCCCCGAGTC CTCCGGCACC
GACAACGAGG ACGAGAACGT CGCCCGCTCG ACCATCAAGC TCCTCGAACT GTCCACCGGT
CGCAAAGCCG TGGCCGCCAA GCGAATCCTC AACGACTTCG GCCCGTCCTT CGACCCGGAC
GGCAAGTACC TGTACTTCGT CGGACAGCGC GAGTTCAACC CCGTCTACGA CTCGCTCCAC
TTCGACCTGA ACTTCCCGAT GGGCTCGCGC CCCTACGCGG TCGCGCTGCG CGCCGACGTC
TCCCCGCCCT TCGTCCCGCA ACCCAAACCG ATGCACGACA CCGGCGAGGA GTCCACCAAG
GACTCCGGAG ACGACGAGTC CAAAACCGAC TCCGACGACA CCACCGACGA GAGCCTCGTC
ATCGACCTGG ACGGCATCGA GAACCGCATC GTGCCGCTGC CGGTCTCCGA CGCCAAGTAC
ACCCAGGTCC TGGGCGTCAG CGGCAAGGTG CTGGTCCTGT CCCACCCGGT CGCGGGCCGC
ATCCGTCCAC ACCAGGTGGA CGACGAACCC GACGGCGTCC TGGACTCCGT CGACCTGGAG
ACCGGCAAGG TCGAGCGCTT CGCCGACGCG GTCAGCTGGG TGATGAGCAC CCCCGACGGC
AAGACCGTCC TGTACATGTC GGGCGACAAG ATGCGCGTCG TCAAGGCCAC CGAGAAGGCC
CCCGACGGCG CCGACCGCAA CCGCGAATCC GGCTGGATCG ACCTAGACCG GCTCAAGGTA
TCGGTACGCC CCGAACTGGA ATGGCCGCAG ATGTTCCGCG AGGCCTGGCG ACTGCTGAGC
GAGAACTTCT GGGTCGAGGA CATGTCCGGA GTGGACTGGA ACGCGATCTA CGAACGCTAC
GCCCCCCTGG TCGCCCAGCT CTCCACCCGT GGCGAACTGT CCGACCTGAT CTGGGAGATG
AACGGCGAAC TGGGCACCTC CCACGTCTAC GAAGCGCTCG GCGACTACCG TCCCGGCCCC
CACTACGGCC AGGGCTACCT CGGCGCCGAC TTCACAGTAG ACACCGACGG CGCCCACACG
ATCGCCAAGA TCTACACCGG CGACCCCTGG AAACCCGACG CCACCTCCTC CCTGCTGCGC
CCCGGCGTCG ACGCCCGCGT CGGCGACACG GTCGTCGCCG TCAACGGCCA ACCCGTCGGA
CCCACCACCT CGGTGGCCCA GCTCCTGGTC AACCAGGCCG ACCAGGAAGT CCGCCTCTCG
CTGCGCCGTG GCGAAGCCGA CCCCCACGTG GTCGTCGTCC GCGCCCTGTC CAACGAACAA
CCGCTGCGCT ACCGCGACTG GGTGGAAGCC AACCGCCGCG CCGTCTACGA CGCCAGCGGC
GGCCGACTGG GCTACATCCA CATCCCCGAC ATGATGACCG AGGGCTTCGC CGAATTCCAC
AGAGGATTCC TCAACGAGTA CGACCGCGAC GGTCTCGTCG TCGACGTCCG CTTCAACGGC
GGCGGCCACG TCTCCCCACT CCTGCTCGAA AAACTCGCGC GCCGCCGCAT CGGCTACAAC
TTCTCCCGCT GGAGCCGCCC CGCCCCCTAC CCCCGCGAAA GCCCGTGCGG AGCGATGGTC
GCGCTCATCA ACGAATCCGC CGGATCCGAC GGCGACATCT TCAGCCACGG CTTCCGCTCC
TACAACCTCG GCCCGCTGGT CGGCACCCGC ACCTGGGGCG GAGTCGTGGG CTACTTCCCG
TGGCGCCCCA ACCTGGCCGA CAACACCTTC CTGTCCCAAC CCGAGATCGC CTTCCACTTC
GACGACGCCA AATGGGGCGT CGAGAACTAC GGCGTCGCCC CCGACATCGA GGTCGAGTAC
GCGCCACAGG ACTACGCCGC CGGACGCGAC CCCCAACTGG AGGCGGGCAT CGCCGCGGCC
CTGAAGGAAC TGGAGAAGAA ACCCGCGCAC CGACCCGATC CGGCGGACCG TCCCAAGCTC
GCCGCCCCGA AGTTGCCGCC CCGTCCGTAA
 
Protein sequence
MTTGYPRFPA IHNDTIVFVA EDDLWSVTTA GGTARRLTAG VAAASFPRIS PDGSLIAYTG 
AEEGPPDVHV VPTEGGETRR LTYEGGRISR VAGWDPTGEF VIYATSAHSP ELLDVRLRRV
SADGGVPAEL TWGRATAIAQ GSNGVVVLGR EYWKDHAHWK RYRGGTAGQL WIDASGDGEF
RKLVSLDGNL SCPHIIGDRV YFLSDHEGHG NVYSTDFDGA NLRRHTDHED FYARGLSGDG
ARLVYHCGGR LYLLDPAEEH PSVVDVHIPV TRTQRARRFV DAAEYLDSVD LSADASHLAV
TTRGKAFTFA DWEGPVTQHG EVDGVRYRLL TWLHDRERLI AAVADNGPRE VLVGITADAS
KPPCRLDHLD TGRAGELVAS PTEGKVVIAN HRNELVLVDV DGEEATATVI DSSRHGEISD
VVFSADGTWL AYACPESSGT DNEDENVARS TIKLLELSTG RKAVAAKRIL NDFGPSFDPD
GKYLYFVGQR EFNPVYDSLH FDLNFPMGSR PYAVALRADV SPPFVPQPKP MHDTGEESTK
DSGDDESKTD SDDTTDESLV IDLDGIENRI VPLPVSDAKY TQVLGVSGKV LVLSHPVAGR
IRPHQVDDEP DGVLDSVDLE TGKVERFADA VSWVMSTPDG KTVLYMSGDK MRVVKATEKA
PDGADRNRES GWIDLDRLKV SVRPELEWPQ MFREAWRLLS ENFWVEDMSG VDWNAIYERY
APLVAQLSTR GELSDLIWEM NGELGTSHVY EALGDYRPGP HYGQGYLGAD FTVDTDGAHT
IAKIYTGDPW KPDATSSLLR PGVDARVGDT VVAVNGQPVG PTTSVAQLLV NQADQEVRLS
LRRGEADPHV VVVRALSNEQ PLRYRDWVEA NRRAVYDASG GRLGYIHIPD MMTEGFAEFH
RGFLNEYDRD GLVVDVRFNG GGHVSPLLLE KLARRRIGYN FSRWSRPAPY PRESPCGAMV
ALINESAGSD GDIFSHGFRS YNLGPLVGTR TWGGVVGYFP WRPNLADNTF LSQPEIAFHF
DDAKWGVENY GVAPDIEVEY APQDYAAGRD PQLEAGIAAA LKELEKKPAH RPDPADRPKL
AAPKLPPRP