Gene Sare_2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2336 
Symbol 
ID5707964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2688195 
End bp2689268 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content66% 
IMG OID641271814 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001537185 
Protein GI159037932 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.930883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.268504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGATCC TGGTCACCGG CGGTGCCGGC TTTATCGGCT CACACTATGT CCGGTCCCTG 
TTGGCTGGTG AGTACGCTGC GGAGGACGCT GTCGGAGCGG TGGACGCCTC CGTCACTGTC
CTGGACAAAC TCACCTACGC CGGCAATCGC GCCAACCTGC CGGCCACCCA CCCACGCCTG
ACCTTCGTGC AGGGTGACAT CTGCGACCAG GGACTGCTGC GGGACCTGTT CAAGGGCCAT
GACGCGATTC TGCACTTCGC CGCTGAGTCA CACGTGGATC GGTCGGTCGA CGGTGGAGCA
ACCTTCGTAC GCACCAATGC GTTGGGAACC CAGTCGGTGC TGGAAGCCGC TGTCGCGGTC
GGCGTACCGC GGGTCGTACA CGTCTCGACC GACGAGGTCT ATGGCTCCAT CGAACACGGG
GCGTGGACCG AGACGTGGCC GCTGCTGCCG AACTCCCCGT ATGCGGCTTC CAAGGCGAGC
GCGGACCTCA TCGCCCGTTC CTACTGGCGA ACCCATGGCC TGGACGTTTC GATCACCCGC
TGTTCCAACA ACTACGGGCC CTACCAGCAT CCGGAAAAGC TCGTCCCGCT GTTCATCACC
AACCTGCTCG AGGGATTGCC GGTGCCCGTG TACGGCACCG GGGTGAACGT GCGGGAGTGG
CTGCACGTGG CCGACCACTG CCGCGCACTC CACCGGGTAC TCACCCGAGG GCGTCCCGGT
GAGGTCTACA ACGTGGGCAG CGGAAACGAG TTGACGAACC TGGAGCTGAC CGAGCGCCTG
CTCCACCTGT GCGGCGCGGA CCGCTCCATG ATCCGTTACG TCACCGACCG CAAGGGGCAC
GACCAGCGGT ACTCCCTTGA CGACACCAAG ATCCGCGAGT GTCTCGGTTA CCGGCCCCGC
GTCTCCTTCG CCGATGGGCT GGCCGAAACC GTCGCCTGGT ACCGCGACAA CCCACAGTGG
TGGAAGGCGG CCAGGCACGG CGCTCCCAAG ACCACCGAGG CCGCGCCTCG GGCGACCACC
GGAGTGGTCC GTGGAACGCA CAACCGTATC GACGACCAGG AGCCACCAAA ATGA
 
Protein sequence
MRILVTGGAG FIGSHYVRSL LAGEYAAEDA VGAVDASVTV LDKLTYAGNR ANLPATHPRL 
TFVQGDICDQ GLLRDLFKGH DAILHFAAES HVDRSVDGGA TFVRTNALGT QSVLEAAVAV
GVPRVVHVST DEVYGSIEHG AWTETWPLLP NSPYAASKAS ADLIARSYWR THGLDVSITR
CSNNYGPYQH PEKLVPLFIT NLLEGLPVPV YGTGVNVREW LHVADHCRAL HRVLTRGRPG
EVYNVGSGNE LTNLELTERL LHLCGADRSM IRYVTDRKGH DQRYSLDDTK IRECLGYRPR
VSFADGLAET VAWYRDNPQW WKAARHGAPK TTEAAPRATT GVVRGTHNRI DDQEPPK