Gene Shel_10200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_10200 
Symbol 
ID8394911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp1162729 
End bp1163739 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID644985778 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_003143404 
Protein GI257063732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00171089 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000737673 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGTTA TATTGAAGCA GCATGCGACG GAAGAGGATC GCCAGAACTT GATCGAGGAG 
CTTGAGCGCT ATGGCGTGAA GGTGCATCTT TCCGAAGGCG ACCAGCGGAC CGTGCTGGGT
CTGGTGGGCG ACACCACGAA GATCGACAAA TCCGTCCTCG AGGCGAATGA TGCCGTGCAC
GCAGTAAAGC GGGTGGCTGA ACCGTACAAG AAGGCGAACC GCAAGTTCCA CCCGGAAGAC
ACGGTCGTAA AGGTGGGCAA CACCCAGGTC GGCGGCGGGA ACCTCACGGT CATGGCGGGT
CCCTGCAGCG TCGAGAGCCG CGAGCAGATC CTGGCTGTCG CCCATGCTGT CAAAGCGGCC
GGTGCGACCA TGTTGCGCGG CGGCGCCTTC AAGCCCCGCA CCAGCCCCTA CAGCTTCCAG
GGCATGGGTC CCCAGGGGCT CGACCTGCTG CGCGAAGCCA AAGAGGAGAC GGGTCTGCCC
ATCGTGACCG AAATCATGGA TCCCAATGAC CTGAAATACT TCCGCGATAT CGACGTGCTG
CAGGTGGGCG CGCGCAACAT GCAGAACTTC GACCTGCTGA AAGCCCTGGG CAAGACCGAC
CAGCCCATCC TTTTGAAGCG CGGTCTTTCG GCAACCTACG AGGAGCTTCT GATGAGCGCC
GAATACATTA TGAGCGAAGG GAACCCCAAC GTCATCCTGT GCGAACGAGG CATCCGCACC
TTCGAAACCT ACACCCGCAA CACCTTGGAC GTAGCGGCTG TGCCCGTGCT GCGCAAGCTG
ACGCATCTGC CGATCATCAT CGACCCCAGT CACTCGGGCG GCCGCCGCGA GCTGGTGGTG
CCGCTGTCGC TGGCGGGTAT AGCCGCCGGG GCCGACGGCA TCGAGGTGGA GGTCCACAAC
GATCCCGCCC ACGCCCTGTC CGACGGTCCC CAGCAGCTGC TGCCCGAAGC CTTCGAGGTC
CTAATGGCGC AGCTCAAGAC GGTTCATGCC GCAGTGCATG CAGAGGCGTA A
 
Protein sequence
MIVILKQHAT EEDRQNLIEE LERYGVKVHL SEGDQRTVLG LVGDTTKIDK SVLEANDAVH 
AVKRVAEPYK KANRKFHPED TVVKVGNTQV GGGNLTVMAG PCSVESREQI LAVAHAVKAA
GATMLRGGAF KPRTSPYSFQ GMGPQGLDLL REAKEETGLP IVTEIMDPND LKYFRDIDVL
QVGARNMQNF DLLKALGKTD QPILLKRGLS ATYEELLMSA EYIMSEGNPN VILCERGIRT
FETYTRNTLD VAAVPVLRKL THLPIIIDPS HSGGRRELVV PLSLAGIAAG ADGIEVEVHN
DPAHALSDGP QQLLPEAFEV LMAQLKTVHA AVHAEA