Gene Snas_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0202 
Symbol 
ID8881380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp215715 
End bp217232 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content69% 
IMG OID 
Productalpha-L-arabinofuranosidase domain-containing protein 
Protein accessionYP_003509014 
Protein GI291297736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.957662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0476441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACC GGCACAAGGC GAGCGTCGTG CTAGACCCGG CGTTCGCGGT GGCCCCCGTC 
GACCGGCGGC TGTTCGGGTC GTTCGTCGAA CACATGGGGC GGTGCGTGTA CGGCGGGATC
TACGACCCCG GCCATCCGTC CGCCGACGAG CACGGCCTGC GCACCGACGT CATCGACCTG
GTGCGGGAAC TGGGCGTGTC GGTGGTGCGC TACCCCGGTG GCAACTTCGT GTCCAGCTAC
CGCTGGGAGG ACGGCATCGG ACCGGTCGCC GACCGGCCCC GGCGGCTCAA CCTGGCCTGG
CGGTGCCTCG AGACCAACGA ATTCGGGCTC GGCGAGTTCA TGACCTGGGC GAGACTGGCC
GTCGTGGAAC CCATGATGAC GGTCAACCTC GGCACCCGGG GCGTCGCCGA GGCCTGCGAC
ATGATCGAGT ACTGCAACCA CCCCGGCGGC ACCGCGCTGT CCGACCTGCG CCGCAAACAC
GGCTCGGCCG ATCCCTACGA CATCAAACTG TGGTGCCTGG GCAACGAGAT GGACGGACCC
TGGCAGGTGG GCCAGAAGAC CGCCGCCGAA TACGGACGCA TCGCCGCCGA GACCGGCAAG
GCAATGCGCA TCGTGGACCC GTCCATCGAA CTGGTCGCCG CGGGCAGCTC CAACTCCCAG
ATGCCGACCT TCGGCGACTG GGAGGCCACT CTTCTGGAAC ACGCCTACGA CCAGGTCGAC
TACCTGTCGT TGCACCACTA TTTCGACCCC GCCAACCAGG ACCGCGACAG CTTCCTGGCC
TCGGGCACCG TCATGGACCG TTTCATCGAC GACGTCGTGT CCACGTGCGA CCACATCGGC
GCCAAACGCC GCAGCCGCAA GAAGATCAAG CTCAGCTTCG ACGAGTGGAA CGTGTGGTAC
CAAAGCCGCT TCACCGAACC CGGTGACCGG GAGTGGATCG AGTCGCCGCC GCTGATCGAG
GACGACTACG ACGCCACCGA CGCCGTGGTC GTCGGCGACC TGCTCATCAC GCTGCTGCGG
CACGCCGACC GGGTCTCGAT CGCCAACCAG GCCCAGCTCG TCAACGTCAT CGCCCCCATC
CGCACCGCCC CGGACGGACC GGCCTGGCGG CAGTCGATCT TCCACCCGTT CGCGCTGACC
TCCCGGCTGG CCCGCGGCAC CGTGCTGCGC ACCGAGACCG CGGGCCCCCG GCACGAAACC
CCGCGCCACG GCGAGGTGCC GACCCTGAGC ACCACCGCCA CCCACGACGC CGCCACCGGC
CAGACCGTCC TGTTCGCCGT GAACCGCGCC GAGCACCCGG TGGAACTGGC GGTGGACGCG
CGCGCCCTGT CCGGCGTCCG GCTCGCCGAA CACCTCACGA TCGCCGAAGA CGACCCCACG
GCGATCAACA CCCCCGCCGA CCCCGACCGG GTCGGACCCC GTCGACTACC ACCATCCGTT
ATGGACAACG GACGCTGTCT GGTGCGGCTG CCCGCGCTGT CCTGGAACGC CCTGCGTTTG
AGTGAAGAGA AAGAGTGA
 
Protein sequence
MDNRHKASVV LDPAFAVAPV DRRLFGSFVE HMGRCVYGGI YDPGHPSADE HGLRTDVIDL 
VRELGVSVVR YPGGNFVSSY RWEDGIGPVA DRPRRLNLAW RCLETNEFGL GEFMTWARLA
VVEPMMTVNL GTRGVAEACD MIEYCNHPGG TALSDLRRKH GSADPYDIKL WCLGNEMDGP
WQVGQKTAAE YGRIAAETGK AMRIVDPSIE LVAAGSSNSQ MPTFGDWEAT LLEHAYDQVD
YLSLHHYFDP ANQDRDSFLA SGTVMDRFID DVVSTCDHIG AKRRSRKKIK LSFDEWNVWY
QSRFTEPGDR EWIESPPLIE DDYDATDAVV VGDLLITLLR HADRVSIANQ AQLVNVIAPI
RTAPDGPAWR QSIFHPFALT SRLARGTVLR TETAGPRHET PRHGEVPTLS TTATHDAATG
QTVLFAVNRA EHPVELAVDA RALSGVRLAE HLTIAEDDPT AINTPADPDR VGPRRLPPSV
MDNGRCLVRL PALSWNALRL SEEKE