Gene Snas_4538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4538 
Symbol 
ID8885743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4842209 
End bp4843906 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content71% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003513276 
Protein GI291301998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.179492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGA TCGACCGGGA CTCATTGCCT CCATTGCGGA CCTGGCAGCG CAAGGCCATG 
GTCGCGTACC ACCGCAACGC CGCCGAGGAC TTCCTCGCCG TGGCCACCCC CGGAGCCGGG
AAGACCACCT TCGCGCTGCG GATCGCCCTG GACCTGCTGG ACAGCGGCAT CGTGTCGCAG
GTGACGGTCG TGGCACCCAC CGAGCACCTG AAGACACAGT GGTCGCAGGC CGCGGCGAGG
GTGGGCATCA ACCTGGACCC CACCTTCCGC AACGCCGACG TCTTCCTGTC CAACGACTTC
CACGGCGCCG TGGTCACCTA CGCCCAGGTC GGCGCCGACC CGTCGGTCCA CAAACGCCGG
ACCCTGGCCA GGCCCACCCT GGTGATCCTG GACGAGATCC ACCACGCCGG TGACGCCCGC
TCCTGGGGCG AGGGCGTGCA GGCCGCGTTC ACGCCCGCCG TGCGCCGTCT GGCCCTGACC
GGTACCCCGT TCCGCTCCGA CGACAACCCG ATCCCGTTCG TGTCCTACGA GCACGGCGCC
GACGGCAGTC TGCGGTCCCG CCCCGACTCG GTCTACGGCT ACACCCACGC CCTGACCGAC
GGCGTGGTCC GCCCCGTCAT GTTCATCGCC TACTCCGGCC AGGCCCGCTG GCGCTCCAGC
GCGGGCGAGG AGCTGTCGGT GCGGCTGGGC GAACCGCTCA CCAAGGACCT CACCGCGCAG
GCCTGGCGCA CCGCCCTGGA CCCGCTGGGG GAGTGGATCC CGCAGGTGCT GCGGGCCGCC
GACTCGCGCC TCAACGCGCT GCGGGCCGCC GGAATGCCCG ACGCGGGCGG CCTGGTCATC
GCCTCCGACC AGAACTCGGC CCGCGCCTAC TCCAAGATCC TGGGCACCAT CACCGGTGAA
CGTCCCGCCC TGGTCCTTTC CGACGACGCC GGATCCAGCG ACCGGATCGC GAAGTTCACC
GAATCCGACG AACGCTGGAT GATCGCCGTG CGCATGGTCA GCGAGGGTGT CGACATCCCG
CGGCTCGCGG TGGGCGTCTA CGCCACCAAC GCCCACACCC CGCTGTTCTT CGCCCAGGCC
ATCGGACGCT TCGTGCGGTC CCGCCGCCGC GGCGAGACCG CCTCGGTGTT CGTGCCCAGT
GTCGCGCCCA TCCTGGAGCT GGCCGCCCAA CTGGAGGCCG ACCGCGACCA CGTCATCGCC
CCCGAGACCG ACACCGACCC CGAGGACCTG CTCGCCGAGG CGCAGCGCCA ACGCGACGAA
CCCGACGGTC TGCGCGGCGA ACGCGAGACC CTCGCGGCGA CCGCCGAGCT CGACCAGGTC
ATCTTCGACG GGGCCTCCTT CGGCACCCCG ATCCAGCCCG GCTCCCCCGA GGAGGAGGAG
TTCCTCGGCC TGCCGGGCCT GCTCACCCCC GAGCAGGTCT CGATGCTGTT GCAGAAACGC
CAGGCCGACC AACTGTCCAC CGCCCGCCCG GGCGCTCAGG CCAGCGTCAA GACCGAGACC
CGGCCGCTGT CGGCGGCCGA ACGCCGCATC GCGTTGCGCA AGCAGCTCAA CGCCCTGGTT
TCGGCCCAGC ACCACGCCAC CAACGTCCCG CACGGCAAGA TCCACGCCGA ACTGCGCCGC
CGCTGCGGCG GTCCCCCCAG CGCCCAGGCC ACCATCGAAC AACTCGAACA GCGCATCGAG
GCCATCAAGG ACCTCTGA
 
Protein sequence
MPVIDRDSLP PLRTWQRKAM VAYHRNAAED FLAVATPGAG KTTFALRIAL DLLDSGIVSQ 
VTVVAPTEHL KTQWSQAAAR VGINLDPTFR NADVFLSNDF HGAVVTYAQV GADPSVHKRR
TLARPTLVIL DEIHHAGDAR SWGEGVQAAF TPAVRRLALT GTPFRSDDNP IPFVSYEHGA
DGSLRSRPDS VYGYTHALTD GVVRPVMFIA YSGQARWRSS AGEELSVRLG EPLTKDLTAQ
AWRTALDPLG EWIPQVLRAA DSRLNALRAA GMPDAGGLVI ASDQNSARAY SKILGTITGE
RPALVLSDDA GSSDRIAKFT ESDERWMIAV RMVSEGVDIP RLAVGVYATN AHTPLFFAQA
IGRFVRSRRR GETASVFVPS VAPILELAAQ LEADRDHVIA PETDTDPEDL LAEAQRQRDE
PDGLRGERET LAATAELDQV IFDGASFGTP IQPGSPEEEE FLGLPGLLTP EQVSMLLQKR
QADQLSTARP GAQASVKTET RPLSAAERRI ALRKQLNALV SAQHHATNVP HGKIHAELRR
RCGGPPSAQA TIEQLEQRIE AIKDL