Gene Snas_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1007 
Symbol 
ID8882192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1068396 
End bp1070405 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content67% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003509810 
Protein GI291298532 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.137378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CTTTCTTGAC GCACTCGGTT GACCAGATCG GCGGCACGAT CCGCGCGACA 
CTGAACACGG CGTCGATACT GACCGACCTG GGGCACGAGG TCGACATCGT CAACGTCTTC
AAGTACCGCG ACGCTCCACA GTTCACGGCC GATCCGCGGG TGACGATGCG CAGCCTCATC
GACAAGACCG CGCCGCGGTC GCTCACCTCG AAGGTGGCGG GCATCCCGGA CCGGTTTCGG
ATGCGGATGA GCTCGAGGTT CTACCCGTCG GGCGACACCC GGGCCAAGCG TTTCAACCGG
CTCACCGACA AGCGGGTGCA GGAGTTCCTG CGTGACTGCG AGGCGGACGT CATCGTCGGT
ACCCGTCCCG GTCTCAACAT CTACCTGGCG CGGTTCTCGC CGCCCAAGGC GGTCACGGTC
GCGCAGGAGC ACCTGTTCTA CGACCACCAC AAGCAGCCGC TGCGTGATGC CATGGCGCGC
GACTTCGGCC AGCTGGACGC CGTGGTCACC GTCAGCCAGG CCGACGCGGA CAACTACCGG
CGCCACATGC CGCACCTGGC CGACAAGGTC TGGTTCATCC CCAACTCGAT CCAGCCGACC
CCGATCCCGC CGTCCGACGT GGACTCCAAG ATCATCGTCG CGGCCGGACG CATCGAACGC
CCCAAGCGTT TCGACATGTT GCTGCGCGTC TTCTCCAAGG TCCACAAGCG GCACCCGGAC
TGGCGGCTGC GCATCTACGG CAGCGGCAAG CGCATCAACG AGATCCGCGA CGTCGTCACC
GATCTGGACC TGGGCGACTC GGTGTCGCTC ATGGGCCGCG CCACCCCGCT GGACACCGAG
TGGGCCAAGG GTTCCATCGC CGCGGTCACC TCCAAGTACG AGTCCTTCGG CCTCACCCTC
GTCGAGGCCA TGAACTGCGG TCTGCCGGTC GTGTCGACGG CCTGCGACTA CGGCCCGCCG
GAGATCATCG ACCACGAGGT CGACGGCCTG CTGACGCCCG TCAAGGACGA GAACGCCGTC
GCCGAGGCGC TGTGCCGTCT CATCGAGGAC GAACGGCTGC GCAAACGCAT GTCCTCCAAC
GCGATCCGCA AGGCCCGCAA GTACCACCCG GACGACATCG GTGCCCGCTA CGAGCAGCTG
TTCGACAGCC TGGTCTCGCA GAAGGACCGC ACCCCGCCGC ACGGGGCCCG CATCCCCGTC
GGCGCCGGGG TCGGCGGCTA CGGCCTGCCC GACAAACCCG CGTCCTCGAC CTACGAACTG
GCCGACCACT CCGTCGACTG CCTGGTCAAG TCCTTCGAGG ACATGACCCT GACCGCCGAA
CTGGGTTTCA GCGGCCGTTA CACGCTCGAA TCCAGCGAAC ACCCGCCCAT CGAGGTCCCG
CTGGGCGCCG AGCTGCGGCT CGACCCGCCG TTCCTGAGCC GACTGCCCGA GGGCCGGTGG
CGCCTGTTCC GCGACGGCGA CCTCGTCGAG GCGGGTCACA TCGACTCGCG CGCCCTGCTG
CACCGGCCCT CGGTCCTGCC GGGTTCGGTC GTGGTCCCCT ATTCCTCCCG AGGCCAACTG
GCCCTTCGGG TCTGGCGTCG CGAAACCTAC GCCGAACTCA ACTCGGTGCG ATGGCACGAC
GGTCACCTCC TGCTGCACGG CGACGTACTC GGCCCACTGT GGGGCGACAC GCCCATCCAC
ATCCGGGGCC GACTGCGCGA AACCGAGGGC CCGGCCCAAC GCTGGGCCGC CGACATCGAC
GCGTCCGGCG CCTTCTTCGC CCGGCTCGAC GTGGACCGGC TCACCGACAT CCGCGTCGAT
CGCAAGGACC TGTGGGACCT GTGGCTGGCC GACGACAACG GCGAACACGC CCCGGTCCGG
CTGGCTCGGT TCTTCGACGA CATCGCCAAG CGCAAGAAGA CCCAGGCCTT CTCCCGCAAG
ACCGTCACCG ACGACAACGG GCGGCACCAC ATCCAGCCGT ACTACAACAC CCACAACGAA
CTGACGCTGA AGGTCACCGA GGACGGCTGA
 
Protein sequence
MKITFLTHSV DQIGGTIRAT LNTASILTDL GHEVDIVNVF KYRDAPQFTA DPRVTMRSLI 
DKTAPRSLTS KVAGIPDRFR MRMSSRFYPS GDTRAKRFNR LTDKRVQEFL RDCEADVIVG
TRPGLNIYLA RFSPPKAVTV AQEHLFYDHH KQPLRDAMAR DFGQLDAVVT VSQADADNYR
RHMPHLADKV WFIPNSIQPT PIPPSDVDSK IIVAAGRIER PKRFDMLLRV FSKVHKRHPD
WRLRIYGSGK RINEIRDVVT DLDLGDSVSL MGRATPLDTE WAKGSIAAVT SKYESFGLTL
VEAMNCGLPV VSTACDYGPP EIIDHEVDGL LTPVKDENAV AEALCRLIED ERLRKRMSSN
AIRKARKYHP DDIGARYEQL FDSLVSQKDR TPPHGARIPV GAGVGGYGLP DKPASSTYEL
ADHSVDCLVK SFEDMTLTAE LGFSGRYTLE SSEHPPIEVP LGAELRLDPP FLSRLPEGRW
RLFRDGDLVE AGHIDSRALL HRPSVLPGSV VVPYSSRGQL ALRVWRRETY AELNSVRWHD
GHLLLHGDVL GPLWGDTPIH IRGRLRETEG PAQRWAADID ASGAFFARLD VDRLTDIRVD
RKDLWDLWLA DDNGEHAPVR LARFFDDIAK RKKTQAFSRK TVTDDNGRHH IQPYYNTHNE
LTLKVTEDG