Gene Sde_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3381 
Symbol 
ID3965930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4312100 
End bp4314049 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content53% 
IMG OID637922478 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_528848 
Protein GI90023021 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.240721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACC AAATTCCGAC GCAGCGCCCG AATACCCCTC TTCTGGATGT CGTAGACACC 
CCAGCACGCC TGCGTGAATT AAGCGAAAAG CAACTCCCCC AGCTAGCCAA AGAACTGCGT
GAATATTTGC TGTATACCGT AGGTCAAACC GGCGGCCATT TCGGCGCAGG TTTAGGGGTT
GTAGAGCTAA CCGTCGCTTT GCACTATGTG TTAAACACCC CAGATGACAG GCTAGTATGG
GATGTAGGCC ACCAAACTTA CCCCCATAAA ATTCTTACCG GCCGCCGCGA GCAAATGTCC
AGCATTCGCC AACTAGACGG CCTATCGGGT TTCCCCAAGC GCAGCGAAAG CGAGTTCGAT
ACCTTTGGCG TTGGCCATTC AAGTACGTCG ATAAGCGCAG CCCTAGGCAT GGCACTCGCC
GCCGAAATGA CCGACAACCA GCAACAAACA GTGGCGGTAA TAGGCGACGG CTCCATGACC
GCAGGCATGG CCTTCGAAGC GCTAAACCAC GCCGCGCACG CCGACACCAA TATGATGGTG
ATATTGAACG ACAACAATAT GTCGATCTCT AAAAATGTGG GCGGGCTGGC CAATTACTTC
TCTAAAATTT GGGCAAGTAA AACATACTGC GCCTTGCGCG AGGGTAGTAA GCGCGTACTT
ACCAAAATTC CACAGGCTTG GGAACTCGCT CGAAAAACCG AAGAGCACAT GAAAGGCATG
GTATCCCCGG GTACCCTATT CGAAGAATTG GGGTTTTACT ATGTGGGCCC CATCGACGGC
CACGACTTAG AGCGCTTGGT ACACGATATT CGCAATATGC TCGCCATCCC CGGCCCCAAG
CTGCTGCACA TCATTACTCA AAAGGGCAAG GGTTTTACCC CCGCCGAAAA AGACCCTGTT
GGCTACCACG CACTCAATAA AATAGAGCCT AAAGCCAGCA TCACCCCCAT TAGCGCCAGC
GGCGGCGCTG AAGCCCCCGC AGCTAGCACA ATTAAAAAGC CCAAATACCA AACCGTATTT
GGCGACTGGT TATGCGATCT CGCAGAAGTC GACCCCTTCG TATTAGGTAT TACACCAGCA
ATGTGCGATG GCAGCGGCAT GGTAGAATTT GCCGAGCGCT TCCCCGACCG CTTTCACGAT
GTCGCCATTG CCGAACAACA CGCCGTAACC CTAGCCGCAG GCTTAGCATG TGAAAAATTC
AAACCTGTTG TAGCCATATA TTCCACCTTT TTACAGCGCG CGTACGATCA ACTGGTGCAC
GACGTAGCAT TGCAAAACCT CGATGTTACC TTTGCCATCG ACCGCGCCGG CCTAGTAGGT
GAAGACGGCC CAACACATGC GGGCGCATTC GATATAAGCT TTTTGCGCTG CATACCCAAA
ATAATTATTG CCACGCCTAG CGACGAAAAC GAATGTCGCC AATTGCTGTT TAGCGCTTAC
CACCACCCAG GTGCAGCCGC CGTACGCTAC CCGCGCGGCA CAGGCCCCGG CGCCGTCATC
GAATTAGAAA ACCAGCACTG GCCCATAGGC AAAGGTCGCG AGCTGCGCCA AGGCAAAACC
GTGTGTTTTA TTAATTTTGG TGTGTTACTA CCCGACGCCA TAGCCGTTGC AGAAGCTAAT
AATTACGGCG TATGCGATAT GCGCTGGGCC AAACCGCTAG ATAAAGACCT ACTGCTAAAC
ATGGCAGAGC AATACGATTA CCTAGTAACC CTCGAAGAAA ACGCCGTCGC CGGCGGTGCG
GGTGCAGGCG TAATGGAGCT TCTGGCCGCC GAAGGCATTA GCACCCCAGT ATTGCCACTG
GGCCTACCCG ACGAGTACTT AGACCACGGC AAACGCAGCC AGCTTTTACA AGCCGCCGGC
CTAGACAGAG CGAGCATTAA CCAGCGGATA AATCAATGGC TAAGCCGCCA TAATGGCGCT
GCGCACGATT CGCAAATACA CAGCCTGTAA
 
Protein sequence
MFDQIPTQRP NTPLLDVVDT PARLRELSEK QLPQLAKELR EYLLYTVGQT GGHFGAGLGV 
VELTVALHYV LNTPDDRLVW DVGHQTYPHK ILTGRREQMS SIRQLDGLSG FPKRSESEFD
TFGVGHSSTS ISAALGMALA AEMTDNQQQT VAVIGDGSMT AGMAFEALNH AAHADTNMMV
ILNDNNMSIS KNVGGLANYF SKIWASKTYC ALREGSKRVL TKIPQAWELA RKTEEHMKGM
VSPGTLFEEL GFYYVGPIDG HDLERLVHDI RNMLAIPGPK LLHIITQKGK GFTPAEKDPV
GYHALNKIEP KASITPISAS GGAEAPAAST IKKPKYQTVF GDWLCDLAEV DPFVLGITPA
MCDGSGMVEF AERFPDRFHD VAIAEQHAVT LAAGLACEKF KPVVAIYSTF LQRAYDQLVH
DVALQNLDVT FAIDRAGLVG EDGPTHAGAF DISFLRCIPK IIIATPSDEN ECRQLLFSAY
HHPGAAAVRY PRGTGPGAVI ELENQHWPIG KGRELRQGKT VCFINFGVLL PDAIAVAEAN
NYGVCDMRWA KPLDKDLLLN MAEQYDYLVT LEENAVAGGA GAGVMELLAA EGISTPVLPL
GLPDEYLDHG KRSQLLQAAG LDRASINQRI NQWLSRHNGA AHDSQIHSL