Gene Sare_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3919 
Symbol 
ID5703770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4459686 
End bp4462289 
Gene Length2604 bp 
Protein Length867 aa 
Translation table11 
GC content70% 
IMG OID641273344 
ProductHAD family hydrolase 
Protein accessionYP_001538701 
Protein GI159039448 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1877] Trehalose-6-phosphatase
[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID[TIGR00685] trehalose-phosphatase
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.936294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.284272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCA GTACCGAACT CGTCAACCCG GTCGGTGGCG GCCTGGACCA GGAGCTACGC 
ACCGCGATCG GTCGGGTCGC CCGCGTCCCG CAACTGCTGA TCGCCTGCGA CTACGACGGC
ACCCTCGCCC CGATCGTCGA GGATCCGAGC ACGGCCGTAC CGTTGCCCGA GGCGGCGGCT
GCGGTACGCG CGCTCGCCTC GCTACCGCAG ACCACCGTGG CGGTTGTCTC CGGACGGGCG
CTGCGTGACC TGGCTGCGCT GTCCCGGCTG CCCAGCGAGG TCCATCTGGT CGGCAGCCAC
GGCTCCGAGT TCGACATCGG CTTCGTCGAG CGGCTCTCCC CCGAGCTGGT CGCCGTCCGC
ACCCGGCTAC GCGACGCGCT GCGCGAGATC GTCGCCGCCC ACCCCGGGAT CCGGCTGGAA
CGCAAGCCGG CCAGTGTCGC CGTACACACC CGCGGCGTTG ACCCGCAGGT CGCCGCCGCA
GCCGTCGAGG CGGTTCGCAA CGGCCCCGCG ACCTGGGACG ACGTCACCGT CACCCAGGGC
AAGGAAGTCA TCGAGTTGTC GGTGGTCGCC ACCCACAAGG GCACCGCCGT CGACCAGCTA
CGTACCCAGC GCTCCGCCAG CGCGGTACTG TTCATCGGCG ACGACGTCAC CGACGAGAAC
GCGTTCGGCA ACCTGCACGG GCCGGACGTC GGAATCAAGA TCGGGCCGGG CGAGACCAAG
GCACAGTACC GGGTCACCGA GCCGATCGAG GCGGCCCGCG CCCTCGGCCT GCTGCTGGAG
ACCCGACGGC ACTGGCTGTT CGGCGAGCGG GCGGTGCCGA TCGAACGGCA CTCGATGCTG
GCCAACGGCC GGACGGTGGC GCTGCTCACT CCCGAGGCCA AGGTCACCTG GCTGTGCAAC
CCGAAGCCCG ACTCCGCGGC GATCTTCGCC GACCTGGTCG GCGGCAGCCC CGCCGGCTAC
TTCAGCGTCG TGCCGGAGCG CGGCGGCATC CCGCTCGGCC AGCGCTACCG CTCGAACACC
ATGACCGTGG AGACCCGGTG GTCCGGCCTG ACCGTGACCG ACTGGCTCGA CACGCCGGAC
ACCGAGACCA CACCCGACGG CCCGGCGATC GTCAGCGCCG ACTCGATCCT GGTCCGGGTG
CTCACCGGAC GCGGCCGAGC CCACCTCGAG TTCGCCCCCC GGCCCGAGTT CGGCCAGGTC
GCCGTTCAGC TGCAACCGCT CGATGACGGG CTGCTGGTAC TCGGTTCCAA CGAGCCGGTC
GCGCTGTACT CGCCCGGTGT GACGTGGGAA GTGACCAGCG ATGGCGTGTA CGAGACGGCG
AAGGCCGTCG TCGACCTGTC CACCGCCGGC GGACCGGTCG TGCTGGAGAT GCGTTTCGGC
ACGCACAGTC TGGAACACCA CCGACTGCCG ATCCACGAAC GGCAGGCTGC GGCGGAACAG
CCCTGGAAGG ACTGGGTGTC CACGCTGCGG CTCCCGAACA CCGGCCGGGA CTTGGTCGCC
CGCAGCGCGC TCACCCTACG CGGGCTGTGC CACCAGCCGA GCGGTTCGAT CCTGGCCGCC
GCGACCACCT CACTGCCCGA GGAACTGGGC GGCGTGCGCA ACTGGGACTA CCGCTACTGC
TGGCTGCGCG ACGCGGCGCT GACCGCCCGC GCTCTGGTCG ATCTCGGCTC CACCGGTGAG
GCCGAGGCGC TGCTGCGTTG GATCGACGGT GTCGTCGAGC GCACCGGTGG GCACCCCGAA
CGGCTGCATC CGCTCTACAC GGTCGACGGT TACGAACTGG GCGCCGAGGC CGTTATCGAC
ACACTTCCCG GCTACGCCGG TTCCCGGCCC GTCCGGGTCG GGAACCTCGC CAACCACCAG
CTCCAACTCG ATGTCTTCGG TCCGATCGCC GACCTGATCG CGGCCGTGGC CGACGCTCGC
GGCTCCGTGC GCGACGACGA GTGGCGGGTG CTGGAGAACA TGGTGGAAGC GGTCCGCCGC
CGCTGGCACG AGCCGGACCA TGGCATCTGG GAGGCGCGCC TGGCGCCCCG ACATCACATC
TTCTCGAAGG TGATGTGCTG GCTGACCGTG GACCGGGCGC TGCACGTCGT ACGTCAGCAC
GGCGGCGAGG ATCAGCCCGA GTGGGTGGAA CTACGCGACC GAATCGGGGC CAACGTGCTC
GAGTTCGGCT GGCACGAGCA GGCCGAGGCG TACAGCGTCG CGTATGGGCA CGAGGACAGT
GACGCCTCCT CGCTCTGGAT CGGACTGTCC GGCCTGCTGC CCGGGGACGA CCCACGCTTC
GTGTCCACCG TGCTCAAGAT CGAGGCGGAC CTGCGCAGTG GCCCGGTCGT CTACCGCTAC
CACTGGGAGG ACGGCCTGCC CGGCCGGGAG GGCGGCTTCC ACATCTGCAC CTCGTGGCTG
ATCGAGGCGT ACCTGCGTAC CGGCCGTCGA GGGGACGCGG AGGAACTGTT CGCCCAGATG
ATCGACACCG CCGGCCCGAC TGGGCTGCTG CCCGAGCAGT ATGACCCGCT GGCCGAGCGC
GGGCTGGGCA ATCATCCACA GGCCTACAGT CACCTCGGCG TGATCCGCTG CGCCCTCCTG
TTGGACAACA TGCTCAAGCA GTGA
 
Protein sequence
MSTSTELVNP VGGGLDQELR TAIGRVARVP QLLIACDYDG TLAPIVEDPS TAVPLPEAAA 
AVRALASLPQ TTVAVVSGRA LRDLAALSRL PSEVHLVGSH GSEFDIGFVE RLSPELVAVR
TRLRDALREI VAAHPGIRLE RKPASVAVHT RGVDPQVAAA AVEAVRNGPA TWDDVTVTQG
KEVIELSVVA THKGTAVDQL RTQRSASAVL FIGDDVTDEN AFGNLHGPDV GIKIGPGETK
AQYRVTEPIE AARALGLLLE TRRHWLFGER AVPIERHSML ANGRTVALLT PEAKVTWLCN
PKPDSAAIFA DLVGGSPAGY FSVVPERGGI PLGQRYRSNT MTVETRWSGL TVTDWLDTPD
TETTPDGPAI VSADSILVRV LTGRGRAHLE FAPRPEFGQV AVQLQPLDDG LLVLGSNEPV
ALYSPGVTWE VTSDGVYETA KAVVDLSTAG GPVVLEMRFG THSLEHHRLP IHERQAAAEQ
PWKDWVSTLR LPNTGRDLVA RSALTLRGLC HQPSGSILAA ATTSLPEELG GVRNWDYRYC
WLRDAALTAR ALVDLGSTGE AEALLRWIDG VVERTGGHPE RLHPLYTVDG YELGAEAVID
TLPGYAGSRP VRVGNLANHQ LQLDVFGPIA DLIAAVADAR GSVRDDEWRV LENMVEAVRR
RWHEPDHGIW EARLAPRHHI FSKVMCWLTV DRALHVVRQH GGEDQPEWVE LRDRIGANVL
EFGWHEQAEA YSVAYGHEDS DASSLWIGLS GLLPGDDPRF VSTVLKIEAD LRSGPVVYRY
HWEDGLPGRE GGFHICTSWL IEAYLRTGRR GDAEELFAQM IDTAGPTGLL PEQYDPLAER
GLGNHPQAYS HLGVIRCALL LDNMLKQ