Gene Snas_3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3073 
Symbol 
ID8884272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3242398 
End bp3244173 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGlycoside hydrolase family 20, catalytic core 
Protein accessionYP_003511837 
Protein GI291300559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.221724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.79825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGC CGACCGAGCT CGTCCTGCTG CCCCGCCCCC GCCACCTCGA CGTCACCGGC 
GACGGTCCGC CCACCACCAC CGAAGCCGTC GAGCACCAGG CCACCGACCT CGAACCGCAG
GGCTTCGAAC TCCACATCAG CGACACGACG ATCAACCTGC GCTACCGCGA CGACGCCGGA
CTGCGCTACG GCCGCGCCAC CCTCGCCCAG CTGCGCGCCC AGTTCCCCGA CCGGCTGCCC
GCCCTGTCCA TCAAGGACTG GCCCGACTTC GCCACCCGCG GCTACATGCT CGACGTCAGC
CGAGGCCGGG TCCCCACCCG CGACACCCTC GAACGCATCG TCGGCCTGCT GGAACTGCTG
CGCGTCAACC ACTTCCAGCT CTACACCGAG CACACCTTCG CCTACACGGC CCACGAAACC
GTGTGGCGCG ACGCCAGCCC CATGACCCCC GACGACATCC GCTGGCTCGA CGAACGCTGC
ACCGAAGCCG GTATCGAACT GGCCGCCAAC CAGAACTGCT TCGGCCACTT CGAACACTGG
CTCAAACACG ACGCCTACCG CGACCGCGCC GAACTGGCCG AGGGCTTCGA ACTGATGGGC
CGCCACCGCC CCGCCACCAC CCTCGCCCCC ACACAGGACA ACGCCGACTT CGCGCTGGAC
CTGGTCCGCG AACTGGTCCC CAACTTCTCC AGCCGCCGCG TCAACATCGG CTGCGACGAA
ACCTGGGAAC TGGGCCGGGG CGTATCGAAG GCGGCGGCCG ACGCCAAAGG CAAGGGCCGG
GTCTACCTGG AACACCTCCA CCGCCTGGTC AACCCGCTGA TCGCCGACGG CCTCGACGTC
CAGTTCTGGG GCGACATCAT CGCCAACCAC CCCGAACTGG CCACCGAACT GCCACAAGGC
GCCACCGCCG TGGCCTGGTG GTACGAGGCC CCCTGGGACC CCGAAGAGCA GGAACGACTG
CTGTGGGAAG TCGGCGACCG TTTCCTGGAC GCCGGGGTCG ACCTCAGCAA GAAACTGGGC
GGCTTCGCGG GCGAGGCCGC ACCCTTCCTC GAATCCAGCT ACCCACTGTG GGTCGCACCC
GGCACCGGCA CCTGGCGTTC CCTGCTGGGC CGACTCCGCA CCGCTTACGG CAACATGCTC
GACACCGTCA ACGTCGGCCT GTCCGGCGGC GTCGAAGGCG TCCTCATCAC CGACTGGGGC
GACGGCGGCC ACCCGCAACC CCCCTCAGTC ACCTTCCCGC CCCTGGCCTA CGGCGCCGCA
CTCTCCTGGT GCCGCGACGC CAACCACGAC CTCAACGTCC CCGCCGTCCT CGACCACTTC
GTCTTCCCCG GCACCAACAT CGGCGCGGCA CTCGAAGCCC TGGGCCACCT CAACGACCGC
ACCGGCCAGA TCGCCTTCAA CTCCAGCCCC CTCCAACTGG CCCTGGAACC CAACGCCCAC
CACGTCGGAG TAGGCGACCC GGACCCGAAG GCCCTGGCGG GAGTCGTCGA CGACATCGAC
GCCCTCATCG CCACCGTCAA CTCCGGCAGC GGCGCCTTCG GCGACCACGA GATCGTCCAA
AACGAACTCA CCGCCGCCGC CCGCCAAGCC CGCCATGGAG CCTGGCGCCT ACTGCGCCAA
GCCGGAGCCC CGGCCCCCGA CACCGCCGCC ATGCGCGCCG ACCTGGCCGA AACCACCGAA
CTCCACCGTC GAGCCTGGCT CTCCCGCGCC CGACCGGGTG GCATGGAAAC CTGGATGACA
ACCCTGACAG CGCTGGCCAA GACCTACGAC TCCTGA
 
Protein sequence
MIAPTELVLL PRPRHLDVTG DGPPTTTEAV EHQATDLEPQ GFELHISDTT INLRYRDDAG 
LRYGRATLAQ LRAQFPDRLP ALSIKDWPDF ATRGYMLDVS RGRVPTRDTL ERIVGLLELL
RVNHFQLYTE HTFAYTAHET VWRDASPMTP DDIRWLDERC TEAGIELAAN QNCFGHFEHW
LKHDAYRDRA ELAEGFELMG RHRPATTLAP TQDNADFALD LVRELVPNFS SRRVNIGCDE
TWELGRGVSK AAADAKGKGR VYLEHLHRLV NPLIADGLDV QFWGDIIANH PELATELPQG
ATAVAWWYEA PWDPEEQERL LWEVGDRFLD AGVDLSKKLG GFAGEAAPFL ESSYPLWVAP
GTGTWRSLLG RLRTAYGNML DTVNVGLSGG VEGVLITDWG DGGHPQPPSV TFPPLAYGAA
LSWCRDANHD LNVPAVLDHF VFPGTNIGAA LEALGHLNDR TGQIAFNSSP LQLALEPNAH
HVGVGDPDPK ALAGVVDDID ALIATVNSGS GAFGDHEIVQ NELTAAARQA RHGAWRLLRQ
AGAPAPDTAA MRADLAETTE LHRRAWLSRA RPGGMETWMT TLTALAKTYD S