Gene Sare_2410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2410 
Symbol 
ID5703694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2775135 
End bp2776514 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content65% 
IMG OID641271887 
Productglycoside hydrolase family protein 
Protein accessionYP_001537258 
Protein GI159038005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.840511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.905848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG GCGTGAGAAT TGCGGCCCTG GTCGGCGTCG TCGCGGTGAC AGCTGCGGGG 
GCCATGGTGG CCACCGCGAC CGCGACCGCA GCGGCCTCGC CGACCGCGAC GTTCGTCAAG
GTTTCCGACT GGGGCACCGG CTGGGAGGGT CGGTACACCA TCACCAACGG GGGAAGCAGC
ACCCTGAACT CCTGGCAGGT CGAGTTTGAC CTACCGACGG GCACCACCGT CGGTTCGTAC
TGGAACGCGC TGATGAACCA CGATGGGCAG CACTACCGCT TCACCAACCA GCACTGGAAC
GGCACGATCA CGCCGGGCGC CTCGGTGACG TTCGGCTTCC TCGGTGCCGG CTCCGGCAGT
CCGAGCGGTT GCCGGCTCAA CGGACAGCCG TGCACACCAA CAGCTCCTCC GACGACGAGT
CCGCCACCGA CCACCGCGCC CCCCAGCACC ACGCCGGTTG CAGCGAACGG GCAGTTGCGG
GTCTGCGGAC AGCGCCTCTG TAACGAGGAC GGCAAGCAGA TCCAACTCCG GGGCATGAGC
ACGCACGGAC TCCAGTGGTA CGCCAACTGC GCGAACACGG CCTCGCTCGA CGTACTCGCC
CAGGAGTGGG GTGCGGACGT TCTGCGAATC TCGATGTACA TCCAGGAAGG CGGCTACGAA
ACTGACCCGC GCCGATTCAC CGACCTGGTC CACGACTACA TCGAACTGGC CACCGCCCGC
GGCCTCTACG CGGTCGTCGA CTGGCACATG CTCACCCCTG GAGACCCGAA CTACAACCTC
TCGCGAGCGC GAACCTTCTT CGCGGAGATC GCCGACCGCC ACCGGGACAA GGTCAACGTC
CTGTACGAGA TCGCGAACGA ACCGAACGGT GTCAGTTGGG GAGCCATCAA GAGCTACGCC
GACCAGGTAA TCCCGGTCAT CCGGGAACGG GATCCGGAAG CCGTGGTGCT TGTCGGCACA
CCCGACTGGT CGTCGCTCGG TGTGTCTGGC AGTGGCGGCG GCGTCGATGC CATCCTCGCC
GATCCGGTGG CAGCGAGCAA TCTCATGTAC GTCTTCCACT TCTACGCGGC ATCACACGGC
GACCCGTACT ACAACACCTT GGCCGACGCG GCCGACCGGC TTCCGATCTT TGTGACCGAG
TTCGGAACCC AGCAGTACAC CGGCGACGGC CCGAACAACT TCACCATGTC CCAGCGCTAC
CTCGACCTCA TGGCGAACAA GAAGATCAGT TGGGTCAACT GGAACTACTC CGACGACTTC
CGCTCTGGCG CGGTCTTCAC GACTGGCACG TGCGCCGCCG GCGAGTTCAG CGGTACGGGC
CCCCTCAAAC CGGCCGGCGG TTGGATACGC GAACGCATGC GTACCGCGGA CGACTTCTGA
 
Protein sequence
MKLGVRIAAL VGVVAVTAAG AMVATATATA AASPTATFVK VSDWGTGWEG RYTITNGGSS 
TLNSWQVEFD LPTGTTVGSY WNALMNHDGQ HYRFTNQHWN GTITPGASVT FGFLGAGSGS
PSGCRLNGQP CTPTAPPTTS PPPTTAPPST TPVAANGQLR VCGQRLCNED GKQIQLRGMS
THGLQWYANC ANTASLDVLA QEWGADVLRI SMYIQEGGYE TDPRRFTDLV HDYIELATAR
GLYAVVDWHM LTPGDPNYNL SRARTFFAEI ADRHRDKVNV LYEIANEPNG VSWGAIKSYA
DQVIPVIRER DPEAVVLVGT PDWSSLGVSG SGGGVDAILA DPVAASNLMY VFHFYAASHG
DPYYNTLADA ADRLPIFVTE FGTQQYTGDG PNNFTMSQRY LDLMANKKIS WVNWNYSDDF
RSGAVFTTGT CAAGEFSGTG PLKPAGGWIR ERMRTADDF