Gene Snas_2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2698 
Symbol 
ID8883896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2839317 
End bp2840426 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID 
Productglycoside hydrolase family 43 
Protein accessionYP_003511468 
Protein GI291300190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.898004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.338928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGCGC GCGATGTGAT GTGGCTCATA ATCGCGGCCG ATTGTCTGCC CGCGGAGCCG 
GAGTTCGCGT CTCTAGCACT GAAGCCCACC GAGGAAGAAA GTGAACCGCC GATGTTCACC
AGTGCGTTGT CATTCGTCCG TCGCCGTAAG GTTCTGGCCA TCGGGGTCGT GGTCGCCGTG
GCCGCGTCCA CAGTAGCCTG TATCGGTCTG GGCGCGGTCA CCCAAGCCGA GGACGCCCAG
ACCAAGGCCG CCAAGGTTCA GAAGGTCATC GACAAGAACT TCGCCGACCC CGAGGTCATC
AAGGTCGGCA ACCGGTACTA CGCCTACGCC ACCAACAACA AGCGGAACCT GCCGGTGGCC
ACTGCGGACA AACCGCGCGG TCCGTGGAAG TTCACCGGCG CCGACGGGAT GCCGAAACTC
GGCGCCTGGG CCAAACCCGG ACGCACCTGG GCGCCGGACG TGTCCCGTCG CCCCGACGGC
AGCTTCCTGC TCTACTACAC CGCGCACCAC GCCAAGTCGG GCCAGCAGTG CATCGGCGCG
GCCACCGCGT CCAAACCGGA GGGACCCTTC ACCGCGGTGG GGAACGCGCC GCTGGTGTGT
CCCACCAAGC AGGGCGGCGC GATCGACGCG GGCAGCTACT CCGAGGGCGG CAAGCACTGG
ATCATGTACA AGGCCGAGAA CAACGTGATC GGCAAGCCGC CTGTCATCTA CATGCACCAG
ACCGGCCCCA AAGGACTGAA GCTCCTCGGT AAGCGGTTCG CGATTCTGCG CAACGACCGG
GCCGTGGAAC GCGGCATCAT CGAGGCGCCG GTGATGGTCA AGCGCGGCGG CAACTACATC
CTCTTCTACG CGGGCGACCA TTTCGCGCGC AACACCTACT TCACCGGTTA CGCGGTCGCC
AAGAAGATCA CCGGGCCGTT CAAGAAGGCC GACAAGCCGC TGCTGAGCAT GAAGGGCCTG
GGTGGGGCCG TGAAGGGACC CGGTGGCGCC GACGTGGTGA CCGGCCCCAA CGGTGTCGAC
CACATCTTCT TCCACGGCCT GGTCGGCAAG GCCCGGCATA TGTACCGGGC CGAGTTGGGT
TGGGTCAAGG GCCGTCCGGT GCTACGCTGA
 
Protein sequence
MSARDVMWLI IAADCLPAEP EFASLALKPT EEESEPPMFT SALSFVRRRK VLAIGVVVAV 
AASTVACIGL GAVTQAEDAQ TKAAKVQKVI DKNFADPEVI KVGNRYYAYA TNNKRNLPVA
TADKPRGPWK FTGADGMPKL GAWAKPGRTW APDVSRRPDG SFLLYYTAHH AKSGQQCIGA
ATASKPEGPF TAVGNAPLVC PTKQGGAIDA GSYSEGGKHW IMYKAENNVI GKPPVIYMHQ
TGPKGLKLLG KRFAILRNDR AVERGIIEAP VMVKRGGNYI LFYAGDHFAR NTYFTGYAVA
KKITGPFKKA DKPLLSMKGL GGAVKGPGGA DVVTGPNGVD HIFFHGLVGK ARHMYRAELG
WVKGRPVLR