Gene Snas_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1931 
Symbol 
ID8883123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2051678 
End bp2053150 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycoside hydrolase family 3 domain-containing protein 
Protein accessionYP_003510720 
Protein GI291299442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACC GCCAAGAACT TCGCCAGCTC GCACTGCGCA CCCTGATTCC CGGTTTCAGC 
GGGACTACGG CGCCACGGTG GGCCCTTGAC CTCCTGGACG AGGGGCTAGG CGGATACTGC
CTCTTCGCCC ATAATGTCGC CGACCCACAG CAACTCTCCG AACTGAACTC AAGCCTCAGA
CAGGCCAATA AGGACGTCCT GATAGCAATC GATGAGGAGG GAGGCGATGT TACGCGCCTT
CACAGTGCCA GTGGCAGCCA CTACCCCGGT AACGCGGCCC TTGGGGCCGT AGACGATCCG
GAGTTGACCG CCGCGGTTCA CCAGTCCATG GGGGCCGAGT TGGCCGCCGC CGGAGTCACC
GTCGACTTCG CGCCGTCGGT GGACGTCAAT GTGGAGGACG ACAACCCCGT CATCGGCACC
CGCTCGTTCG GACGCGACCC GCAGGGCGTG GCCCGGCACG CGGTCGCGGC GGTGCGCGGG
CTGCACCGGG CCGGGATCAT CGCCTGCGCC AAACACTTCC CCGGTCACGG CGCCACCGTC
GTCGACTCCC ACTTCGAGGT GCCGACGGTG GACATCCCGC TGTCGGAGCT GCGGGAACGT
GAACTGGTGC CGTTCCGCGC CGTCATCGAC TCGGGCATCG AACTCATCAT GAGCGGCCAC
ATCCGGGTCC CCGAACTCAC CGGCGACGCG CCCGGCACCA TGTCCCGCGC GGCCATGCAC
GACCTGCTGC GCGGCGAACT GGGCTTCGAC GGCGCCATCG TCACCGACGC GATGGAGATG
CGCGGAGCCA GTGGCGCCAT CGGGATGCCC GAGGCCGTGG TCCGCGCGAT CGCGGCCGGG
TGCGACCTGA TCTGCACCGG CGGCGAACTC CAGAAACGCG GTCCCATGAC CGAGGTCGTC
AACGCTGTCG CCGACGCGGT CGCCACCGCC GTCATCGACG GTCGGCTGCC CTACGAGCGA
CTCGCCGACG CCGTGCGTCG TGGCGACGTG CTGCGGGAGT GGCAGCGCGA GAACCGGGGT
TCCCGAGCCC CGCACGCCCT GGGGCTTGCC GCCGCCCGAC GCGCCATCAC CGTGGAGGGA
ACCGTTCCGG CACTGAGCAA CCCGCTCATC GTCCAGATCG ACTCCACCGC CAACATCGCG
GTCGGCGAAT CGCAGTGGGG CGTCACGCCG TTGTTGGCCA AGCACCTGCC GCACGCCCAG
ATCCGCAACG TCACCCCCGA GAACGCCAGC GTCGCCCAGA TCTCCGCGCT GGCCGACAAC
CGCCCGGTCA TCGTCGTGGC CCGCGATACC CACCGGCGCC CGGCGTCCAA GGGCTTCGTC
GAGGGACTGG CGGCTTCCGG CCACGACGTG GTGCTCGTCG AGATGGGCTG GCCCGCGGCC
TGGCGGCCGG AGGGCGTGGC GGCCTACGTC GCCTCGTTCG GGGCGGCGGC GGTCAACGCC
GAGGCCACCG TCGAGGTGCT GTTGGGTTCG TGA
 
Protein sequence
MPDRQELRQL ALRTLIPGFS GTTAPRWALD LLDEGLGGYC LFAHNVADPQ QLSELNSSLR 
QANKDVLIAI DEEGGDVTRL HSASGSHYPG NAALGAVDDP ELTAAVHQSM GAELAAAGVT
VDFAPSVDVN VEDDNPVIGT RSFGRDPQGV ARHAVAAVRG LHRAGIIACA KHFPGHGATV
VDSHFEVPTV DIPLSELRER ELVPFRAVID SGIELIMSGH IRVPELTGDA PGTMSRAAMH
DLLRGELGFD GAIVTDAMEM RGASGAIGMP EAVVRAIAAG CDLICTGGEL QKRGPMTEVV
NAVADAVATA VIDGRLPYER LADAVRRGDV LREWQRENRG SRAPHALGLA AARRAITVEG
TVPALSNPLI VQIDSTANIA VGESQWGVTP LLAKHLPHAQ IRNVTPENAS VAQISALADN
RPVIVVARDT HRRPASKGFV EGLAASGHDV VLVEMGWPAA WRPEGVAAYV ASFGAAAVNA
EATVEVLLGS