Gene Sare_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4050 
Symbol 
ID5706313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4606710 
End bp4608149 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID641273476 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001538831 
Protein GI159039578 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.567483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0446183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACGC CACCCGGGGC GCTTGCCCCC CTACCCGCGA GACGCCGGAA GCGACTGGCG 
CTGGCGCTGT TCACTACCCT CACCACCACC GCGCTGGCCG GACTCGCGGT CACCGCGAAT
GCCGCCGTAC CCGCCCCTGC GCCCGGCTGG AGTCTGGTCT GGAGTGACGA CTTCGACGGC
GCGGCCGGTA CTCTGCCCTC CTCGGCCAAC TGGATCATCG ACGTCGGCAC CAGCTACCCT
GGCGGGCCGC CCAACTGGGG TACCGGTGAG ATCCAGACGT ACACCGACAG CACCGCCAAC
ATCAGTCACG ACGGTGCCGG GAACCTGCGA ATCACCCCAC TGCGAGACTC GTCGGGCGGG
TGGACCTCGG CCCGGATCGA GACCGTGCGT ACCGACTTCA AACCGCCGTC CGGTGGCGTC
CTCGCGATCG AGGGACGGCT CCAGGTGCCG AACGTGACCG GCGCCCAGGC GGCCGGCTAC
TGGCCGGCGT TCTGGGCGCT CGGGTCACCG TACCGAGGCA ACTACCAGAA CTGGCCCAGC
ATCGGCGAGT TCGATGTGAT GGAGAACGTC AACGGGATCA ACTCCGTGTG GGGTGTGCTG
CACTGCGGCT ACGCGCCGGG CGGGCCGTGC GACGAGTTCA ATGGCATCGG TGCCTCCCGG
ACCTGCCCGG GGGCGACCTG CCAGTCGGCG TTTCACACCT ACCGGTTCGA GTGGGACGCC
TCGGTCAGTC CACAGGTGCT GCGCTGGTAC GTCGACGGCG AGCTCTACCA CACGGTGACC
GAGACCCGGG TCGGTGAGCC GGCCTGGTCG CAGATGACCG GCCACGCCGG CTACTTCCTG
CTGCTCAATG TGGCGATGGG AGGCGCGTTC CCGAACGGTG TCGCCGGGGG AACCACCCCG
ACCGCCGCGA CGGTGCCGGG TCGACCAATG GTCGTCGACT ACGTCGCCGT CTACAGCCGT
GGTGGGGGCA CCGCGCCGCC GACCACCGCA CCGCCGACCA CCGCGCCACC GACCACTGCG
CCGCCCGGCG GGGTGCGGGA TGCCTACGGG AGGATCGAGG CCGAGTCGTT CAACGGTCAG
AGTGGGGTCA GGGCGGAGGA CTGCTCCGAG GGCGGACAGA ACATCGGGTA CCTGCGTGAC
GGTGACTGGG CCCGGTACGA CAACGTCGAG TTCGGAACAA CGCCACCACG GGACTTCGTC
GCTCGGGCCG CCTCCGGCGC CGGAGACGGG GTGAGCGGCT TGGTCGAGGT ACGACTGGGA
AGTCCGACCA GCCCGCCGAT CGGTAGCTTC GCGATCGGTG ACACCGGCGG CTGGCAGAGC
TGGCGTTCGG TGCCCGGTAA CGTCGCCGGA CCCACCGGCC GCCACACGGT CTACCTGACC
TTCACCAGCG GCCAGCCGAA CGACTTCGTC AACATCAACT GGTTCTCCTT CCGCCGCTGA
 
Protein sequence
MHTPPGALAP LPARRRKRLA LALFTTLTTT ALAGLAVTAN AAVPAPAPGW SLVWSDDFDG 
AAGTLPSSAN WIIDVGTSYP GGPPNWGTGE IQTYTDSTAN ISHDGAGNLR ITPLRDSSGG
WTSARIETVR TDFKPPSGGV LAIEGRLQVP NVTGAQAAGY WPAFWALGSP YRGNYQNWPS
IGEFDVMENV NGINSVWGVL HCGYAPGGPC DEFNGIGASR TCPGATCQSA FHTYRFEWDA
SVSPQVLRWY VDGELYHTVT ETRVGEPAWS QMTGHAGYFL LLNVAMGGAF PNGVAGGTTP
TAATVPGRPM VVDYVAVYSR GGGTAPPTTA PPTTAPPTTA PPGGVRDAYG RIEAESFNGQ
SGVRAEDCSE GGQNIGYLRD GDWARYDNVE FGTTPPRDFV ARAASGAGDG VSGLVEVRLG
SPTSPPIGSF AIGDTGGWQS WRSVPGNVAG PTGRHTVYLT FTSGQPNDFV NINWFSFRR