Gene Snas_4502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4502 
Symbol 
ID8885707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4803482 
End bp4804777 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content71% 
IMG OID 
Productcysteine/1-D-myo-inosityl 2-amino-2-deoxy-alpha- D-glucopyranoside ligase 
Protein accessionYP_003513240 
Protein GI291301962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00204978 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAATCTT GGCAGGCACC CGACCTCAAG CGGCTCGGCG GCCAGGCGGT GCCGCTGCGG 
CTGTACGACA CCGCGCGGCA GGCCGTCCAC GCCACCGAAC CGCACCCCGA CGGCGCCCGC
ATGTACGTCT GCGGCATCAC CCCGTACGAC GCCACCCACC TGGGGCACGC GGCGACGATG
GTCACCTTCG ACGTGATCAA CCGGGTATGG CGCGACAACG GTCACGATGT CGCCTATGTC
CAAAATGTCA CCGATGTGGA CGAACCGCTC TTCGAACGGG CCGAACGCGA CGGTGAGGAC
TGGGTGGTGC TGGGCATGCG CGAGACCGCG CTGTTCCGCG AGGACATGGA GGCACTGTCC
ATAATCCCAC CCAAGGCCTA CGTCGGCGCG GTCGAGTCGA TGCCGACCAT CGCCGCGCTG
GTGGAGCGGC TGGTCGACGC CGGGGCCGCC TACACCGTCG ACGACGGCAC CGGCGACGTC
TACTTCCCGG TCACCGCCAC CGAGGGCTTC GGATCCGAGT CCCACTACGA CCGCGACACC
ATGCTGCGGT TCTTCGGCGA ACGCGGCGGC GACCCCGACC GCTCCGGCAA ACGCGACCCG
CTGGACGCGC TGCTGTGGCG CGGCGAACGC GAGGGCGAGC CGTCCTGGGA GTCGCGGCTG
GGCCGTGGCC GACCCGGCTG GCACATCGAG TGCGCCGCCA TCGCCCTGGC CCACCTGGGC
GACCGCATCG ACGTCAACGG CGGCGGCAAC GACCTGATCT TCCCGCACCA CGAGTTCTCG
GCCGCGCACG CCGAGGCCGC CACCAAGGCG GTGCCGTTCG CGAAGCACTA CGTGCACGCG
GGCATGATCG GCCTGGACGG CGAGAAGATG TCCAAGAGCC GCGGCAACCT GGTGTTCGTG
TCCCGGCTGC GCGCCGACGG CGTCGACCCG GCCGTGATCC GGCTGGCGCT GCTGGACGGG
CACTACCGCG AGGACCGGCC CTGGACCGCC GAGCTGCACG CCGCGGCCGC CGACCGGCTG
GCGCGCTGGC GCGAGGCGAT GGGCATGTCC TCGGGCGCGT CCGGCACCAC CACCGCGCAG
CGGGTGCGGG AACGGCTGTC CGACGACCTG GACACCCCGG GGGCGCTGCG GGCCGTCGAC
GAGTGGGCCG CGGCCAGTCT CACCGGCGCC CACCACGACG CTCACGCGCC GGCGCTGGTT
CGTGACACAG TGGAGAGCCT GCTCGGAGTC ACGCTGTTAC GGGCGGTGTT TCGGCAGCGC
ACAACCAGTG CGACGACATT GGATAGGTCA CGGTAG
 
Protein sequence
MESWQAPDLK RLGGQAVPLR LYDTARQAVH ATEPHPDGAR MYVCGITPYD ATHLGHAATM 
VTFDVINRVW RDNGHDVAYV QNVTDVDEPL FERAERDGED WVVLGMRETA LFREDMEALS
IIPPKAYVGA VESMPTIAAL VERLVDAGAA YTVDDGTGDV YFPVTATEGF GSESHYDRDT
MLRFFGERGG DPDRSGKRDP LDALLWRGER EGEPSWESRL GRGRPGWHIE CAAIALAHLG
DRIDVNGGGN DLIFPHHEFS AAHAEAATKA VPFAKHYVHA GMIGLDGEKM SKSRGNLVFV
SRLRADGVDP AVIRLALLDG HYREDRPWTA ELHAAAADRL ARWREAMGMS SGASGTTTAQ
RVRERLSDDL DTPGALRAVD EWAAASLTGA HHDAHAPALV RDTVESLLGV TLLRAVFRQR
TTSATTLDRS R