Gene Snas_5347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5347 
Symbol 
ID8886556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5680466 
End bp5682784 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content71% 
IMG OID 
Productpeptidase M28 
Protein accessionYP_003514074 
Protein GI291302796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.886373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.171469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGGTG CGTTCATCCC CGACGGCCGC CGCCCGCTGG CCGCCGTCGC GGCGTTGCTG 
GTGCTGGCCG TGGCCGTCAC CGTGAACCTG ATCGTCGACG CGTCCAGCCC GGCACCCGCC
GACCCCGGCA GGACCGAGTT CAGCGCCGAA CGCGCCCGCG ACGTCCTGGA GGACATCGCG
ACCAAACCGC GTCCGCTGGG CAGCGAGGAG AGCGACCGGG TCCGCGACGA CCTCGCCGAC
AAGCTGCGCG AACTCGACTA CGACGTCGAC GTGACCGAGG ACGTCGGCGG CGAGGCACGG
GACAACGAGG TCGTGTTCGG CCGCGTCGAC AACGTCGTGG CCACACTGCC GGGCACCGAC
CCGACCGGGC GGGTGCTGCT GGTGTCGCAC TACGACTCCG TGGCCGCCGG TCCCGGCGCG
GGCGACGCGG GCACCCCGAC CGCCGCGGTC CTGGAGACCG CCCGCGCGCT GGCGGCGGGA
CCCAAGCCGC GCAACGACAT CGTGGTGCTG CTGACCGACG GCGAGGAGAC CGGCCTGCTG
GGCGCCGACG CGTACGCCCG CGAGCACCCG TCCAAGGGCA ACGACGTGGT GCTGAACTGG
GAGGCGCGCG GCACCGACGG CCCGTCGCTC ATGTTCGAGA CCTCCACCGG CAACTCCCGG
CTCATCGACG TCTACGCCGA TTCCGCGCCC CACACCACCG GCGATTCCTC CATGGTGGAG
GTATACCGGC ACATGCCCAA CGACACCGAC TTCACGAACT TCAGCGCCGC CGGGTACTCC
GGGCTGAACT CGGCGAACAT CGGCTCCCCC GCGTGGTACC ACACCCCGGG CGACTCACTG
GACCATGTGG ACCCCGCGAC CATGCAGCAC CACGGCGCGA ACATGCTCGG CCTCGCGGCA
GCGTTCGGCG ACACCGATCT GGCCACCATC CAGTCGGACT CCGACACCGT CTACTTCCAC
TTCCTCGGCC TGTTCGTGAG TTACACGCCC ACCTGGGGGA TCGTGCTGTC GGTGCTGAGC
CTGGGGCTGC TGGTGGCGGC CGTCGCCATC GCCCGGCGCC GCGACCTGCT CAGCCTGCCC
CGGTTCGGCA TCGCGGTCGC CACGATCCCG GTGGCACTGG CGGTTTCGGT GGCGCTGGCG
CAGGCGATGT GGATCGCGAT GACCCTGTCG CGGTCCGGGG TGGCCGACAC CCGCGGCATG
CTGCACGCCC CCGCCCCGTT CGTGTTCGCG ACCGGGCTGC TGGCGGTCGC GGCCGTGTTG
GCGTGGTACC TGACGCTGCG TCGGCGACTG GGCCCGGCCG CGCTGGCTTT GGGGGCGCTG
ACCTGGTTGG CGCTGTTGGG ATCGGTGCTG GCGTTCGTGG CTCCCGGGGC CGCGTTCCGG
CTGGCGATCC CCGCGCTGTT CGCCGCGGCG GGCATCGTCG GCGCGCTGCT GCTGGCACCG
CGTGCACCGA TGTGGACGGC GGTGGCCGCG CTGGCCGTCG GCATGGTCCC GGTCGCGGTC
ATCCTGGTGA GCACCGGCAG CAGCCTCGCC GAGGCGTTCG GGCTGTCGAT GGCGGGAGCG
CCCGCGGTGC TGTTCGCGTT GGCGGGGACG GTTTCGGTGC CACTGCTCGA GCTGTGGCTG
CCCTCTCCCG ATCGGCCGAG GCATCGTTTC GGCTGGGCGG TACCGGTCGG CGCCGCCGTC
GTGGCCGTCG TGGTCTGCGC GACGGGCCTG GCCGTCACCG GGTTCGACGA GGAGGAACCG
CGTCCCACCT ATCTGGCCTA TGTGCTCAAC GCCGACACCG GCAAGGCGAC CTGGGTGACC
CAGGACACCG ACACGTCACA GTGGACTTCC GAGTTCATCT CCACGGCCGG TGACCCCGAC
CGGGTGCCCG ACGGGTACCG GGAACCGAAG GTACCGATCA ACAACTGGGG CCGGGCACCC
GACGTCGACC TGCCAGCGCC GACCGCGACG GTGACCCGCG TCGGGGACAA GCTGCGGGTC
CACATGGAGT CCAAGCGCGG GGCGGACAAT CTGACCCTGC GCACGGTCGG TGCCGTCACG
AAGGTGACGG CCTGGGTGCC AGACGACAAG ACCGTCACCA AGAACCTGAC GGTCACGGAG
TCCGGGCCGT GGGGCAGTTC GATCAGCTTC CGCGATCCGC CGAAGGGCGG AGTCGACGTC
ACGCTGACCT TCGACGGGAA GGTCGAGCCG GAACTGATGC TCTACGACCG GTCCGACGGG
CTGGACGACA TCCCCGGCTA CCGGGACCGC CCCGACGACG AGATGCGCTC CCCGATCCGC
AGCAGCGACA CGGTAACCGT CGTGACCACG GTCGACTAG
 
Protein sequence
MRGAFIPDGR RPLAAVAALL VLAVAVTVNL IVDASSPAPA DPGRTEFSAE RARDVLEDIA 
TKPRPLGSEE SDRVRDDLAD KLRELDYDVD VTEDVGGEAR DNEVVFGRVD NVVATLPGTD
PTGRVLLVSH YDSVAAGPGA GDAGTPTAAV LETARALAAG PKPRNDIVVL LTDGEETGLL
GADAYAREHP SKGNDVVLNW EARGTDGPSL MFETSTGNSR LIDVYADSAP HTTGDSSMVE
VYRHMPNDTD FTNFSAAGYS GLNSANIGSP AWYHTPGDSL DHVDPATMQH HGANMLGLAA
AFGDTDLATI QSDSDTVYFH FLGLFVSYTP TWGIVLSVLS LGLLVAAVAI ARRRDLLSLP
RFGIAVATIP VALAVSVALA QAMWIAMTLS RSGVADTRGM LHAPAPFVFA TGLLAVAAVL
AWYLTLRRRL GPAALALGAL TWLALLGSVL AFVAPGAAFR LAIPALFAAA GIVGALLLAP
RAPMWTAVAA LAVGMVPVAV ILVSTGSSLA EAFGLSMAGA PAVLFALAGT VSVPLLELWL
PSPDRPRHRF GWAVPVGAAV VAVVVCATGL AVTGFDEEEP RPTYLAYVLN ADTGKATWVT
QDTDTSQWTS EFISTAGDPD RVPDGYREPK VPINNWGRAP DVDLPAPTAT VTRVGDKLRV
HMESKRGADN LTLRTVGAVT KVTAWVPDDK TVTKNLTVTE SGPWGSSISF RDPPKGGVDV
TLTFDGKVEP ELMLYDRSDG LDDIPGYRDR PDDEMRSPIR SSDTVTVVTT VD