Gene Snas_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2947 
Symbol 
ID8884146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3105924 
End bp3107282 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content65% 
IMG OID 
ProductCellulose 1,4-beta-cellobiosidase 
Protein accessionYP_003511715 
Protein GI291300437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000458427 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00017321 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCAAC TCAGACGTTA TTGGACGCGG TTCGCCGTGA TGTTGTTGTC GATCGGGGCG 
GTGCTGGTGT CTGGTTCTAC GGCCAGCGCT ATCCCTGCCG ACGGCCCGGC AGTTCAGGCT
CGCGTCGACA ACCCGTATGC GGGGGCTCGG CCTTACGTAA ATCCGGAATG GTCGGCCAAA
GCGGCCGCCG AGCCCGGTGG CAGCGCCATC GCGGACCAGC CGACCGGCGT GTGGCTGGAC
CGCATCGCCG CCATCGAGGG CGCGGGCAGT GCCATGGGCC TGCGCGATCA CCTGGACGCG
GCCCTCGCCC AGGACGCGAA CCTGGTGCAG CTGGTCGTGT ACGACCTTCC TGGACGCGAC
TGTTCCGCAC TGGCTTCGAA CGGAGAACTC GCTCCCGACG AGATCGGCAG GTACCGCGAC
GAGTTCATCG ATCCGATCGC GGCGATTCTC GCCGACCCGG CCTACGCGGG GTTGCGCATC
GTGACCGTTG TGGAGATCGA TTCGCTGCCC AACCTCGTGA CGAATGTCAG CCCGCGTCCC
ACGGCGACGC CGGAATGCGA TGTGATGGCC GCGAACGGCA ATTACGTCAA CGGCATCGGT
TACGCGCTGC GGCAGTTCGG CGCGATCGAC AACGTCTACA ACTACCTGGA TGTCGGCCAT
CACGGTTGGC TGGGTTGGGA TGACAACTTC GCGCCCGGTG CACGGAAGTT GCTGGAGGGC
GCTCAGGCTT CCGGCAGCGT GGACAATGTC CACGGGTTCA TCACCAACAC CGCCAACTAC
GGTGCCCTCA AGGAGCCGTA CTTCACGATC AATGACACCG TGAACGGCCA GACCGTGCGC
CAGGCGAAGT GGATCGACTG GAACCGTTAC GTGGACGAGC TGTCCTACGC GCAGGCGTTC
CGCGCCGAAC TGGTCCGGAT CGGCTTCAAC TCGGATATCG GCATGTTGAT CGACACCGGC
CGTAACGGCT GGGGTGGTTC CGCGCGTCCG GCCGGGCCTG GCCCGACTAC TTCGGTGGAC
GCTTATGTCG ATGGTGGACG CCTCGATCGC AGGATCCACC TCGGTAACTG GTGCAACCAG
TCCGGGGCCG GGCTGGGGGA GCGTCCTACC GCCGCTCCCG AGTCGGGGAT CGACGCTTAT
GTGTGGATGA AACCGCCGGG CGAGTCCGAC GGTTCCAGCA AGGAGATCCC CAACGACGAG
GGCAAGGGCT TTGACCGGAT GTGCGATCCG ACTTATGAGG GGAACATCCG CAATGGGTTC
AACCCGCCCG GATCGCTTCC CGACGCCCCG CTGTCGGGGC ACTGGTTCGG CGCGCAGTTC
CGTGAGCTGC TGGCCAACGC CCATCCGCCG CTGACCTGA
 
Protein sequence
MSQLRRYWTR FAVMLLSIGA VLVSGSTASA IPADGPAVQA RVDNPYAGAR PYVNPEWSAK 
AAAEPGGSAI ADQPTGVWLD RIAAIEGAGS AMGLRDHLDA ALAQDANLVQ LVVYDLPGRD
CSALASNGEL APDEIGRYRD EFIDPIAAIL ADPAYAGLRI VTVVEIDSLP NLVTNVSPRP
TATPECDVMA ANGNYVNGIG YALRQFGAID NVYNYLDVGH HGWLGWDDNF APGARKLLEG
AQASGSVDNV HGFITNTANY GALKEPYFTI NDTVNGQTVR QAKWIDWNRY VDELSYAQAF
RAELVRIGFN SDIGMLIDTG RNGWGGSARP AGPGPTTSVD AYVDGGRLDR RIHLGNWCNQ
SGAGLGERPT AAPESGIDAY VWMKPPGESD GSSKEIPNDE GKGFDRMCDP TYEGNIRNGF
NPPGSLPDAP LSGHWFGAQF RELLANAHPP LT