Gene Snas_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2028 
Symbol 
ID8883221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2148218 
End bp2149513 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content70% 
IMG OID 
ProductN-6 adenine-specific DNA methylase 
Protein accessionYP_003510816 
Protein GI291299538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.144009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGG GGGCGCGGCG GCCGCGATCG CCGCTGTACG ACCATCTGCG CGACAACTTC 
TGGTTCGCGC CGCTGGTCGC CCTGGCGGGC GCGGTGGTGG GTGCCCAGCT GGCGGTCAGA
CTGGACGAGT TCGTGATCGA GCTGGCCGAC ACCTGGCGCG ACACCGAGCT GCTGTACCTG
CTCCAGGCGG TCAACAAGTC GACTCGCGGC ATCATCTCCT CGGTCACCGG CGCGATGCTG
ACCTTCGTCG GCGTGGTGTT CTCGATCTCG CTGGTGGCGT TGCAGATGGC CTCCAGCCAG
TTCTCGCCGC GGGTGCTGCG GCTGTACATC CGCAGCCGGA TCACCAAGGC CACGCTGTCG
GTGGGTCTGG CGACCTTCCT GTTCTCGCTG CTGGTGCAGC TGGGTTTCGA CGACTCCGAC
ATCACCACCA CCGCCTCGGT GCCGCTGTTC TCCAGCCTCG GCTCGGTGGC GCTGGTCGTC
ACCAGCCTGG TGCTGTTCGT CTTCTACGTC AACGCGACGC TGCGACTGCT GCGGGTCAAC
CACGTCCTGG CCGAACTGGC CACCGAGACG CTGCGGGTCA TCGCCGCCCG GCGGTTCGAG
CCCCGGGACC ACGCCTTCGA CGCCGAGGTC GCCGCGACGG TGCGGTTCAC CGGAGGCCGC
TCCGGCGTGC TGCGCGACGT CAACCTGCCC CGGCTGATCC GGTTGGCGCG CAAGCACGAC
ACCGTGATCG AGGTGGTGCC GAAGGTGGGG GACTTCCTCA CCACCGGCAC CCCCGCGATC
AAAGTCCACG GTGGCCGGAC CCCGGAACTG TGGCGGGTGC GCGGCTGCCT GAGCGTCGGC
AGCGAACGCA GTCCGCGCCA GGACGTCGGC TTCGGGGTGC GGCAGATCGC CGACATCGGG
ATCCGGGCGC TGTCCCCGGC CGTCAACGAT CCGACCACGG CCGTGGCCGC GATCGACCGG
TTGTTGCAGA TCCTGGCCGG GCTGGTGTCC CGTCCGGACA GTCACTCCTG GTACCGCGAC
CGGGCCGGGC GGCTGCGGCT CATGGTCCCC GAACCGAGCG TGGCGGGCCT GTTGGACACG
GCGTTCACCG AGTTCCGGGT CTACGGGGCC GGATCGCCGC AGGTCACCCG ACGACTGTTG
TCCGCTTTGG ACGATCTGGC GGCCATCGCG ATCGACGCGC ACCGCCCCGC GATCCGGCGG
CACCGGCGAC TGCTGATGAC CGCCGTCGCG GCCACCACGA GCCGCTCCGA CGAACGCGAG
TTCGCGCTGA CCCCCGACCG GCAGGGCATC GGATAG
 
Protein sequence
MAEGARRPRS PLYDHLRDNF WFAPLVALAG AVVGAQLAVR LDEFVIELAD TWRDTELLYL 
LQAVNKSTRG IISSVTGAML TFVGVVFSIS LVALQMASSQ FSPRVLRLYI RSRITKATLS
VGLATFLFSL LVQLGFDDSD ITTTASVPLF SSLGSVALVV TSLVLFVFYV NATLRLLRVN
HVLAELATET LRVIAARRFE PRDHAFDAEV AATVRFTGGR SGVLRDVNLP RLIRLARKHD
TVIEVVPKVG DFLTTGTPAI KVHGGRTPEL WRVRGCLSVG SERSPRQDVG FGVRQIADIG
IRALSPAVND PTTAVAAIDR LLQILAGLVS RPDSHSWYRD RAGRLRLMVP EPSVAGLLDT
AFTEFRVYGA GSPQVTRRLL SALDDLAAIA IDAHRPAIRR HRRLLMTAVA ATTSRSDERE
FALTPDRQGI G