Gene Sare_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0525 
Symbol 
ID5707127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp596413 
End bp598788 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content68% 
IMG OID641270051 
ProductKojibiose phosphorylase 
Protein accessionYP_001535445 
Protein GI159036192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.659551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0300138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGAG AACGCGCATA TCCGGTTGAA CCATGGCATG TCCGAGAAAC CCGGCTGGAC 
ATGGACGTGC TGGCCCAGTC CGAATCGGTG TTCGCCCTGT CCAATGGACA CATCGGGCTG
CGCGGCAACC TCGACGAGGG GGAACCGTAC GGCCTTCCCG GCACCTATCT CAACTCCTTT
TACGAGCTGC GCCCCCTACC GCACGCCGAG GCTGGATTCG GCTTCCCCGA ATCCGGGCAG
ACCATCGTCA ACGTGACCAA CGGCAAGCTC ATCCGGCTGC TGGTCGACGA CGAACCACTC
GATGTGCGCT ACGGCGAACT CCTCTCCCAC GAGCGGGTGC TCGACCTGCG CGAGGGCACG
CTGCATCGGA CCGTCCACTG GCGCTCGCCG GCCGGCCGGG AGGTGTGCAT TCGCAGCACG
CGACTGGTCT CCTTCCGCCA GCGGTCCGCA GCCGCCATCA ACTACGAGGT CGAAGTCGTC
GACAACAAGC CGCTGCGGCT GATCGTTCAG TCGGAGTTGG TGGCCAACGA GACACTGCCG
GCCCAGAGCA AGGACCCGCG GGTGGCGGCA GTGCTGGAGT CGCCCCTGCT GGCCGAGGAG
GAACTGACCA CCGACGACGG CGGCCTGCTG ATCCACCGTA CGAAGGTCTC CGGGCTGCGC
CTGGCCGCGG CCATGGCACA CGAGGTGCGG ACCACCGCAC GGACCAACAT CGAGTCCGAG
GGCTACGAGG ACTGGGTGCG CACCACCACC GCCTGCGTGC TCAAGCCCGG AGAGACGCTA
CGGGTCGTGA AGTACCTGTC GTACGGGTGG TCCAGCCGCC GGTCGCTACC GGCGCTGCGG
GACCAGGTCG GTGCCGCGCT GGCCGCCGCC CGTCTGGACG GGTGGGACGG GTTGCTCCGG
GAACAGCGGG AGTACCTCGA GGAGTTCTGG GACTCCGCCG ACGTGCTGGT GGAGGGTGAC
CCCGAGGTGC AGCAGGCAGT ACGCTTCGGT CTCTTCCACG TGCTCCAGGC AGGAACCCGG
GCCGAGCGCC GGCCGATCTC GGCGAAGGGG CTCACCGGGC CGGGGTACGA CGGGCACGCC
TTCTGGGACA CCGAGATGTT CGTGTTGCCG GTGCTCACGT ACACCCATCC CACCGCGGTG
CGTGATGCCC TGTACTGGCG GCATCACACC CTGCCAGCTG CCCGGGACCG GGCCCGCACG
CTCAACCTGG CGGGCGCCGC TTTTCCCTGG CGCACGATCG ACGGGCCGGA ATCCTCCGGC
TACTGGCCGG CCGGCACCGC CGGCTTCCAC GTCGCCGCCG ACATCGCCGA CGCGGTGCGC
CGTTACGTGT ACGCCACCAA GGACACCAGC CTGGAGCGGG AGATCGGCCT GGAACTGCTG
GTGGAAACCG CGCGGCTGTG GCGGTCGCTG GGCCACCACG ACCGCCACGG GCAGTTCCAC
ATCGACGGGG TGACCGGCCC GGACGAGTAC ACGGCCATCA AGAACGACAA CATCTACACC
AACCTCATGG CCCAACGGAA CCTCATCACC GCCGCGGACA CGGCGATGCG GTACCGGGAC
GAGGCGGGGC ACCTCGGCGT CACCGACGAG GAGGCCGCGG CCTGGCGGGA CGCGGCCGCC
GACATCCACA TACCGTACGA CGAGGAGATC GACGTACATG AGCAGGTTGA GGGTTTCACC
CGGTTCCAGG AGTGGGACTT CCTCCACACG CCTCCGGAGA AGTACCCGCT GCTGCTGCAC
TATCCGTACT TCGATCTGTA TCGCAGGCAG GTGATCAAGC AGGCCGACCT GGTGTTGGCG
ATGCACTGGC GGGGGGACGC GTTCACCCGG GAGGAGAAAC TGCGCAACTT CCTCTACTAC
GAGCGCCGTA CCACCCGTGA CTCGTCGCTG TCAGCCTGCA CCCAGGCTGT GTTGGCGGCC
GAGGTGGGGT ACCCGGAACT GGCCCACACC TACCTGCGTG AGGCCGCGCT GATGGACCTG
CACGACATCA ACGAGAACAC CAGGGACGGC GTGCACATCG CCTCGCTGGC CGGCGCGTGG
ATCGCCCTCG TCGCCGGGTT CGGCGGGCTG CGCGACCACG ACGGAACTCT GCGGTTCGCG
CCGCGGCTCC CCCACCGGCT GGGCCGGTTG GAGTTCTCGC TGCAGTGGCG GGGCACCCAG
CTCCGGGTCG ACATCCAGCC GCACCAGACC ACGTACGAAC TTCGGCACAG CGATCCCGAC
GAGGCCCTGG AACTGGTCCA CCACGGTGAG CGGATTCGGG TCACCTGTGC CGAGCCGGTG
ACCCGGCCGA CCCCACCACC CGGTAAGCCT GGCCCAACCC CGGAGCAACC TCCGGGTCGC
GATCCACTCA TCCGTCTACC CGAACGGGCA CCATAG
 
Protein sequence
MIRERAYPVE PWHVRETRLD MDVLAQSESV FALSNGHIGL RGNLDEGEPY GLPGTYLNSF 
YELRPLPHAE AGFGFPESGQ TIVNVTNGKL IRLLVDDEPL DVRYGELLSH ERVLDLREGT
LHRTVHWRSP AGREVCIRST RLVSFRQRSA AAINYEVEVV DNKPLRLIVQ SELVANETLP
AQSKDPRVAA VLESPLLAEE ELTTDDGGLL IHRTKVSGLR LAAAMAHEVR TTARTNIESE
GYEDWVRTTT ACVLKPGETL RVVKYLSYGW SSRRSLPALR DQVGAALAAA RLDGWDGLLR
EQREYLEEFW DSADVLVEGD PEVQQAVRFG LFHVLQAGTR AERRPISAKG LTGPGYDGHA
FWDTEMFVLP VLTYTHPTAV RDALYWRHHT LPAARDRART LNLAGAAFPW RTIDGPESSG
YWPAGTAGFH VAADIADAVR RYVYATKDTS LEREIGLELL VETARLWRSL GHHDRHGQFH
IDGVTGPDEY TAIKNDNIYT NLMAQRNLIT AADTAMRYRD EAGHLGVTDE EAAAWRDAAA
DIHIPYDEEI DVHEQVEGFT RFQEWDFLHT PPEKYPLLLH YPYFDLYRRQ VIKQADLVLA
MHWRGDAFTR EEKLRNFLYY ERRTTRDSSL SACTQAVLAA EVGYPELAHT YLREAALMDL
HDINENTRDG VHIASLAGAW IALVAGFGGL RDHDGTLRFA PRLPHRLGRL EFSLQWRGTQ
LRVDIQPHQT TYELRHSDPD EALELVHHGE RIRVTCAEPV TRPTPPPGKP GPTPEQPPGR
DPLIRLPERA P