Gene Shel_00140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_00140 
Symbol 
ID8393906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp21130 
End bp22116 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content62% 
IMG OID644984789 
Product4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
Protein accessionYP_003142439 
Protein GI257062767 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACG TGCGCGTGTT CTCGCCGGCC AAGGTGAATC TGCATCTTGA TATCAGCGAG 
CGCCGGCCGG ACGGCTACCA TGGGGCTTAT TCCATCATGC ATGCGCTGTC GATGCACGAC
ATGCTGACCA TGCGCCGTGA GTTTGCGGCG CCGGGCTCGG GCTTGGTTGT GGATGTGCAT
TGCGCCACCC ATGGCGATAT TGCTGAGCTG AACATTCCCG CAGAATCGAA CATCGCTTAC
AAGGCCGTCG TCCGTCTTGC CGAGGCTCTT GGACGGACTG GGGATGATTC CGTTCTGATA
GGTATCGACA AGAACATTCC CCATGCGGCG GGGCTAGGCG GCGGTTCGTC AAACGCCGCT
GCAGCGCTGC TGGGTGCCTG TGCGCTATGG GATATAGACT TGGCCGACAC TGGTGTTCGC
GCGATCGTGG AGCAGGTGGC ATCAGGATTG GGTGCCGACG TGCCTTTCTT CCTGCACGGC
GGGTGCGTGG CCCTTACCGA TAAGGGCGAC ATATACGAGC GTGACCTGGT GCCTAGCAAA
CGCAACGTGG TCATTGTGCG GCCTGAGGAG GGTGTTTCCA CCGGCGCTGC CTATGCGGCA
TTCGACGCGA ATCCGCCTCT GTCCAGCGAC GAGGTCAAAG CGGACGCCCG TGCGGCTGAA
TCCGCCGACG ACCTGCATCT TTTCAACAAC CTGGCTCCCG CTTCCGAAGG CCTGCTGCCG
GTTCTGACCG ATATTCGCGA GTGGTTGTCG GGCCATGCGG GCGTAGCGCA TGATGCAACC
ACAGGTGCGC CCCAGGTCCT TCTGTGCGGC AGTGGTTCGT CCACCTTCGC CATCTGCGAC
GACTTCGATG CGGCCTACAA GCTTGTGGGT GACGCTCGAT TGAACGGCTG GTGGGCCCGC
AGCTGCAATT TCACCAGCGC AGGAGCGCGC GTGCTGCCCA CGGCGGGTCA GGCCACCAAC
CTTGGTGCCG TGCAAAAGTC CTGGTAG
 
Protein sequence
MNHVRVFSPA KVNLHLDISE RRPDGYHGAY SIMHALSMHD MLTMRREFAA PGSGLVVDVH 
CATHGDIAEL NIPAESNIAY KAVVRLAEAL GRTGDDSVLI GIDKNIPHAA GLGGGSSNAA
AALLGACALW DIDLADTGVR AIVEQVASGL GADVPFFLHG GCVALTDKGD IYERDLVPSK
RNVVIVRPEE GVSTGAAYAA FDANPPLSSD EVKADARAAE SADDLHLFNN LAPASEGLLP
VLTDIREWLS GHAGVAHDAT TGAPQVLLCG SGSSTFAICD DFDAAYKLVG DARLNGWWAR
SCNFTSAGAR VLPTAGQATN LGAVQKSW