Gene Snas_6094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_6094 
Symbol 
ID8887315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6452913 
End bp6454562 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003514811 
Protein GI291303533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG GGCCGTTGAT CGTCCAAAGT GATAAAACTC TCCTGCTCGA AACCGGCCAC 
CCGCTGGCCG CCGAGTGCCG GGTGGCCATC GCGCCGTTCG CGGAGCTGGA ACGCGCGCCC
GAACACATCC ACACCTACCG GCTGACGCCG CTGGGGCTGT GGAACGCGCG GGCCGCCGGA
CACGACGCCG AGGGGGTCGT GGACGCGCTG CTCAAATACG CCCGCTACCC GGTGCCGCAC
TCGCTGCTGC TGGACCTCAC CGAGACCATG GACCGCTACG GGCGGCTGCG GCTGCTCAAG
CACCCCGCGC ACGGGCTGGT GCTGCACGGA CTCGACCCGG CGGTGCTGGC CGAGGTGGCG
GGCAGCAAGA AGCTGGCCGG GATGCTGGGT ACCCGCATCG AGGAAGACAC CATCGTCGTG
CACGCCAGCG AGCGCGGTCG TCTGAAGCAG GCGCTGCTGA AACTGGGCTG GCCCGCCGAG
GACAACGCGG GCTATGTGGA CGGTGAAGCG CACCCCATCG GGCTCAAGGA GTCCGGCTGG
CAGCTGCGTC CGTACCAGAA GGAAGCGGTG GAGTCGTTCT GGTCGGGCGG TTCCGGGGTG
GTCGTGCTGC CCTGCGGCGC GGGCAAGACC CTGGTGGGCG CCGCCGTCAT GGCCCAGGCG
CAGAAGACGA CGCTGATCCT GGTCACCAAC ACCGTCTCGG TGCACCAGTG GCGGCGGGAA
CTGTTGGCGC GCACCACCTT GACCGAGGAC GAGATCGGCG AGTACTCCGG CGAGCGCAAG
GAGATCCGGC CGGTCACCAT CGCCACGTAC CAGGTGATGA CGGCTCGCAG CAAGGGCGAG
TTCCGGCACC TGGACCTGTT CGACGCCCGC GACTGGGGCC TGATCGTCTA CGACGAGGTG
CACCTGCTGC CCGCCCCGAT CTTCCGGTTC TCCGCCGACC TGCAGACCCG CCGCCGTCTC
GGTCTGACCG CGACCCTGGT GCGTGAGGAC GGCCGGGAGG CCGACGTGTT CTCCCTGATC
GGCCCGAAGC GTTACGACGC CCCCTGGCGC GACGTCGAGT CGCAGGGCTG GATCGCCCCG
GCCGAGTGCA CCGAGGTGCG GGTGACGCTC ACCGACGCCG AGCGGATGGC CTACGCGGTG
TGCGAGGAGA CCGACCGCTA CCGTGCCGCG GCCACCATGG ACGCGAAGCT CGACGCCGTC
GAGTCGATCG TCGGCAAACA CAAAGGCGAA CGGGTGCTCG TCATCGGCGC CTACCTCGAC
CAGCTGGAGG ACCTGTCCAA ACACCTGGAC GCCCCCGTCG TGCAGGGCTC GACCCGCACC
AAACAGCGCG AGGAACTGTT CGCGGCGTTC CGCTCGGGCG AACTGACGAC CCTCATCGTG
TCCAAGGTGG GCAACTTCTC GATCGACCTG CCGGAGGCGG CGGTGGCCAT CCAGGTCTCG
GGAACCTTCG GTTCCCGGCA GGAGGAGGCG CAGCGACTGG GACGGATCCT GCGGCCCAAA
TCGGACGGAC GCGGCGCGCA CTTCTACACC GTGGTCTCCC GCGACACCGT CGACACCGAG
TACGCCGCGC ACCGGCAGCG GTTTCTCGCC GAGCAGGGCT ACGCCTACCG GATCGTCGAC
GCCGAGGACC TGCGCGGCCG GGACAGCTGA
 
Protein sequence
MNDGPLIVQS DKTLLLETGH PLAAECRVAI APFAELERAP EHIHTYRLTP LGLWNARAAG 
HDAEGVVDAL LKYARYPVPH SLLLDLTETM DRYGRLRLLK HPAHGLVLHG LDPAVLAEVA
GSKKLAGMLG TRIEEDTIVV HASERGRLKQ ALLKLGWPAE DNAGYVDGEA HPIGLKESGW
QLRPYQKEAV ESFWSGGSGV VVLPCGAGKT LVGAAVMAQA QKTTLILVTN TVSVHQWRRE
LLARTTLTED EIGEYSGERK EIRPVTIATY QVMTARSKGE FRHLDLFDAR DWGLIVYDEV
HLLPAPIFRF SADLQTRRRL GLTATLVRED GREADVFSLI GPKRYDAPWR DVESQGWIAP
AECTEVRVTL TDAERMAYAV CEETDRYRAA ATMDAKLDAV ESIVGKHKGE RVLVIGAYLD
QLEDLSKHLD APVVQGSTRT KQREELFAAF RSGELTTLIV SKVGNFSIDL PEAAVAIQVS
GTFGSRQEEA QRLGRILRPK SDGRGAHFYT VVSRDTVDTE YAAHRQRFLA EQGYAYRIVD
AEDLRGRDS