Gene Snas_4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4053 
Symbol 
ID8885254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4320902 
End bp4322032 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content66% 
IMG OID 
Productglycoside hydrolase family 18 
Protein accessionYP_003512798 
Protein GI291301520 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0226392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA CCCGCGTCCT CAGCCTCGCC GCGATCCCGG TACTCGCGGC CGGGCTGGCC 
ATCACCACCA CCTCGGCCGG ACACGCCGAC CCCAACCCGC CCGACCGGGC CGAACTGCCG
AAGCACGCCC TCATCGGCTA CCTGCACTCC AGCTTCGCCA ACGGATCCGG CTACCTGCCG
ATGTCCGAGG TCCCCGACGA ATGGGACATC ATCAACCTCG CCTTCGGCGA ACCCACCTCC
GTCACCTCCG GCGACATCCA GTTCGACCTG TGTCCCAAGG AGGAATGTCC GAATGTGGAA
ACCGAGGACG AGTTCAAGGC CGCGATCAAG GACAAGCAGG CCAAGGGAAA GAAGGTGCTG
CTGTCCATCG GCGGCCAGAA CGGCCAGGTC CAGCTGACCA CCGCCGCCGC CCGGGACAAG
TTCGTCGAAT CCGTGGGCGG CATCATCGAC GAGTACGGCC TGGACGGTCT CGACGTCGAC
TTCGAGGGAC ACTCGCTGTA CCTCGACTCC GGCGACACCG ACTTCGAGAA CCCCAAGACC
CCGGTCATCG TCAACCTGAT CGACGCCCTG GACGCGCTCA AGGCCCGCTA CGGCGACGCC
TTCACCCTCA CCATGGCACC CGAGACCTTC TTCGTCCAGG TGGGACACCA GTTCTACGGC
GGAGCGGGCG GCGGCGACAA CCGCACCGGC GCTTACCTTC CGGTGATCCA CGCGGTGCGC
GACTACCTGA CCGTCCTGCA TGTACAGGAC TACAATTCCG GTCCCGTGAT GGGACTGGAC
GGCCAGTACC ACAACATGGG CAACGCCGAC TTCCACATCG CGATGACCGA CATGGTCAAG
GCGGGCTTCC CGGTGGCCAG CACCGGAAAG ACCTTCCCGG GCCTGCGCGA GGACCAGATC
GGCTTCGGCG TCCCGGCCGC CACCAGCGCC GGAAACGGCC ACACCTCACC CGAGGCGGTG
CAGCAGGCCC TCGGCTGTCT GGCCACAGGG GAGGACTGCG GCGGCTACGA ACTGCGCGGC
GGCCCGTCAC CCGCGATCCG CGGCCTGATG ACCTGGTCGA TCAACTGGGA CAACTACTAC
AAGTGGGAGT TCATGAACGC GCATGAGCCG TACCTGAACG GACTGCCGTA G
 
Protein sequence
MKRTRVLSLA AIPVLAAGLA ITTTSAGHAD PNPPDRAELP KHALIGYLHS SFANGSGYLP 
MSEVPDEWDI INLAFGEPTS VTSGDIQFDL CPKEECPNVE TEDEFKAAIK DKQAKGKKVL
LSIGGQNGQV QLTTAAARDK FVESVGGIID EYGLDGLDVD FEGHSLYLDS GDTDFENPKT
PVIVNLIDAL DALKARYGDA FTLTMAPETF FVQVGHQFYG GAGGGDNRTG AYLPVIHAVR
DYLTVLHVQD YNSGPVMGLD GQYHNMGNAD FHIAMTDMVK AGFPVASTGK TFPGLREDQI
GFGVPAATSA GNGHTSPEAV QQALGCLATG EDCGGYELRG GPSPAIRGLM TWSINWDNYY
KWEFMNAHEP YLNGLP